You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We attempted to train the model by following the instructions provided in the Train Diffusion Texture Painting guide. However, the results we obtained were significantly below the quality of the pre-trained checkpoints provided in the repository.
Experiment Setup
Here is a detailed description of our training setup:
🤗Accelerate Configuration: Default settings with single GPU training
Below are the key issues we encountered during training:
Validation Results: The model struggles to inpaint certain samples effectively. We suspect that the random mask generation algorithm might be a contributing factor, especially as it sometimes creates large/entire areas of black or white regions.
Training Loss: The training loss does not decrease as expected, which is abnormal and suggests potential issues with the training process or configuration.
Drawing Results: The generated stamps are not connected properly, and the previews fail to reproduce the intended brush strokes.
Request for Assistance
Given these issues, we kindly request the authors or maintainers to guide the following points:
Are there specific considerations or adjustments to the random mask generation algorithm that could improve performance, particularly for large regions of missing pixels?
Could the abnormal training loss behavior be related to the dataset, optimizer settings, or any overlooked configuration details?
Are there any additional settings, debugging tips, or known limitations of the current implementation that might help us achieve better results?
We would greatly appreciate your insights or suggestions. Thank you for your work on this project, and we look forward to your response.
The text was updated successfully, but these errors were encountered:
The training loss is expected for diffusion models. Since the model is predicting the noise at randomly sampled timesteps, at earlier timesteps, it is harder to denoise (higher loss at that training step), while at later time steps it is easier (lower loss at that training step).
The released model weights was trained for 100 epochs using 8 gpus with batch size 32 per gpu. For single gpu, you can try training with gradient accumulation to achieve same total training batch size of 256
This was my setup
1 09/07/2023 02:51:23 - INFO - __main__ - ***** Running training *****
2 09/07/2023 02:51:23 - INFO - __main__ - Num examples = 5640
3 09/07/2023 02:51:23 - INFO - __main__ - Num Epochs = 100
4 09/07/2023 02:51:23 - INFO - __main__ - Instantaneous batch size per device = 32
5 09/07/2023 02:51:23 - INFO - __main__ - Total train batch size (w. parallel, distributed & accumulation) = 256
6 09/07/2023 02:51:23 - INFO - __main__ - Gradient Accumulation steps = 1
7 09/07/2023 02:51:23 - INFO - __main__ - Total optimization steps = 2300
We attempted to train the model by following the instructions provided in the Train Diffusion Texture Painting guide. However, the results we obtained were significantly below the quality of the pre-trained checkpoints provided in the repository.
Experiment Setup
Here is a detailed description of our training setup:
Observations and Issues
Below are the key issues we encountered during training:
Validation Results: The model struggles to inpaint certain samples effectively. We suspect that the random mask generation algorithm might be a contributing factor, especially as it sometimes creates large/entire areas of black or white regions.
data:image/s3,"s3://crabby-images/fb61c/fb61ce6d080ead7b831d6494e8b35ab7b2b17d2b" alt="Final Validation Result"
Training Loss: The training loss does not decrease as expected, which is abnormal and suggests potential issues with the training process or configuration.
data:image/s3,"s3://crabby-images/4492d/4492da5ab75acf97d409a041c932fd15844ebbd8" alt="Train Loss"
Drawing Results: The generated stamps are not connected properly, and the previews fail to reproduce the intended brush strokes.
data:image/s3,"s3://crabby-images/52dbc/52dbc71b530975a4cadad7157a84996830c04a34" alt="Drawing Result"
Request for Assistance
Given these issues, we kindly request the authors or maintainers to guide the following points:
We would greatly appreciate your insights or suggestions. Thank you for your work on this project, and we look forward to your response.
The text was updated successfully, but these errors were encountered: