Problems with model training #1

anothersin · 2021-07-28T02:58:12Z

Thanks a lot for your contributions. We are very interested in your work, but after retraining the model (with everything adjusted to match the parameters in the paper), we found the new model output unacceptable and appeared to be outputting the wavelet components. It is worth mentioning that we did not find any problem in the output using the provided pre-trained model. This made us confused. So we replaced another dataset and this still happened. We are sure that it is not a problem with the input and output of the model. Now we are at a loss, can you help us?
Our training environment is torch1.7 cuda10.1 on tesla v100

Translated with www.DeepL.com/Translator (free version)

hhb072 · 2021-08-02T07:19:17Z

I am happy you are interested in our work. May I ask when does this problem occur? During the early training or later?

I have retrained the code on the DDN dataset and do not find the problem.

anothersin · 2021-08-02T07:48:38Z

Thank you very much for your reply. To be precise we encountered this problem during training, we used a new dataset for training and tested the model with 10, 100 and 500 epochs respectively and the output was similar to this image. In order to rule out whether it is the influence of the training data, we trained directly with Rain100L and the results were similar to this case. In order to exclude the influence of the test code, we tested with the model you provided and found the results are normal.
The change we made to the code was that we only changed the DataLoader, but we double-checked to make sure the data wasn't entered incorrectly.
The specific changes are as follows, we remove 'args.trainfiles' and replace it with traversing the input folder to get the input image name.

In main.py load dataset part.

trainfiles = os.listdir(opt.trainroot)
# train_list = readlinesFromFile(trainfiles)   opt.trainroot='./Deraining/Datasets/train/input/'
train_list = trainfiles
assert len(train_list) > 0

train_set = ImageDatasetFromFile(train_list, opt.trainroot, crop_height=opt.output_height,
                                 output_height=opt.output_height, is_random_crop=True, is_mirror=True,
                                 normalize=None)

in dataset.py load_image function

  imgR = Image.open(file_path)
  image_L_path = join('./Deraining/Datasets/train/target/' + os.path.basename(file_path))
  imgL = Image.open(image_L_path)

ghost · 2021-09-24T12:36:09Z

Thanks a lot for your contributions. We are very interested in your work, but after retraining the model (with everything adjusted to match the parameters in the paper), we found the new model output unacceptable and appeared to be outputting the wavelet components. It is worth mentioning that we did not find any problem in the output using the provided pre-trained model. This made us confused. So we replaced another dataset and this still happened. We are sure that it is not a problem with the input and output of the model. Now we are at a loss, can you help us?
Our training environment is torch1.7 cuda10.1 on tesla v100

Translated with www.DeepL.com/Translator (free version)

I also encountered the same problem. Have you solved it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems with model training #1

Problems with model training #1

anothersin commented Jul 28, 2021

hhb072 commented Aug 2, 2021

anothersin commented Aug 2, 2021

ghost commented Sep 24, 2021

Problems with model training #1

Problems with model training #1

Comments

anothersin commented Jul 28, 2021

hhb072 commented Aug 2, 2021

anothersin commented Aug 2, 2021

ghost commented Sep 24, 2021