Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training on ADE20K #162

Open
loveis98 opened this issue Apr 27, 2019 · 5 comments
Open

Training on ADE20K #162

loveis98 opened this issue Apr 27, 2019 · 5 comments

Comments

@loveis98
Copy link

loveis98 commented Apr 27, 2019

Hello! I try to train default models from train.py on ADE20K, but I see such error:

Input arguments:
weights_decoder  
batch_size_per_gpu 2
weights_encoder  
workers          16
lr_pow           0.9
seed             304
epoch_iters      5000
weight_decay     0.0001
root_dataset     ./data/
list_train       ./data/train.odgt
optim            SGD
num_epoch        20
ckpt             ./ckpt
list_val         ./data/validation.odgt
id               baseline
imgMaxSize       1000
deep_sup_scale   0.4
fix_bn           False
lr_encoder       0.02
gpus             0
beta1            0.9
random_flip      True
num_class        150
start_epoch      1
imgSize          [300, 375, 450, 525, 600]
arch_decoder     ppm_deepsup
disp_iter        20
padding_constant 8
segm_downsampling_rate 8
lr_decoder       0.02
fc_dim           2048
arch_encoder     resnet50dilated
Model ID: baseline-resnet50dilated-ppm_deepsup-ngpus1-batchSize2-imgMaxSize1000-paddingConst8-segmDownsampleRate8-LR_encoder0.02-LR_decoder0.02-epoch20
# samples: 20210
1 Epoch = 5000 iters
Traceback (most recent call last):
  File "train.py", line 325, in <module>
    main(args)
  File "train.py", line 200, in main
    train(segmentation_module, iterator_train, optimizers, history, epoch, args)
  File "train.py", line 37, in train
    loss, acc = segmentation_module(batch_data)
  File "/home/ponomareva/.local/lib/python3.5/site-packages/torch/nn/modules/module.py", line 477, in __call__
    result = self.forward(*input, **kwargs)
  File "/home/ponomareva/semantic-segmentation-pytorch/models/models.py", line 34, in forward
    (pred, pred_deepsup) = self.decoder(self.encoder(feed_dict['img_data'], return_feature_maps=True))
TypeError: list indices must be integers or slices, not str

@loveis98 loveis98 changed the title Training Training on customer dataset Apr 27, 2019
@loveis98 loveis98 changed the title Training on customer dataset Training on ADE20K Apr 28, 2019
@zhukaii
Copy link

zhukaii commented Apr 29, 2019

I also encountered the same problem. Is there any solution?

@komms
Copy link

komms commented May 15, 2019

Training should be done on mutiple GPU's. Refer #41

@hangzhaomit
Copy link
Collaborator

@loveis98
Our customized DataParallel only supports multi-gpu training. But in the upcoming version (distributed branch), single-gpu training will be supported.

@ShihuaiXu
Copy link

ShihuaiXu commented Sep 12, 2019

@hangzhaomit
thanks your help and now this project supports single gpu?

@adityashrm21
Copy link

adityashrm21 commented May 20, 2020

@loveis98 You might want to try this:
#203 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants