Restarting from previous checkpoint #40

jbrry · 2023-10-26T16:42:15Z

Hi, do you know what the best way to resume training from a previous checkpoint would be? Let's assume I am training for 100k steps but I have a 24-hour time limit, and I just have the following checkpoints available:

ls checkpoints/pretrain
vanilla_11081_12.0%.pth  vanilla_11081_25.0%.pth  vanilla_11081_50.0%.pth

Given that the generator and discriminator are instantiated as separated models, do we point them to the same .pth file? Also, I believe the .from_pretrained() method requires a single config.json so how do we merge the two configs if that is necessary?

Thanks

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restarting from previous checkpoint #40

Restarting from previous checkpoint #40

jbrry commented Oct 26, 2023

Restarting from previous checkpoint #40

Restarting from previous checkpoint #40

Comments

jbrry commented Oct 26, 2023