Skip to content

Training and PreTrained #4

Answered by htoyryla
metaphorz asked this question in Q&A
May 10, 2022 · 9 comments · 16 replies
Discussion options

You must be logged in to vote

The option steps means the diffusion steps inside the diffusion model, and best left at 1000.

Trainsteps is more like what you want, but it is not epochs but more like batches (I need to check how exactly it counts).

Roughly speaking we train accum batches, then update model. After saveEvery update rounds we store model and generate samples. That's what you should look at.

I have found that accum = 10, saveEvery = 100 and nsamples = 2 work well. I.e. after 1000 batches we store the model and generate two samples.

Trainsteps derives from the original code, perhaps we should count the number of such rounds (1000 batches in the example).

I have usually trained for from half a day to two days…

Replies: 9 comments 16 replies

Comment options

You must be logged in to vote
1 reply
@metaphorz
Comment options

Answer selected by htoyryla
Comment options

You must be logged in to vote
12 replies
@htoyryla
Comment options

@metaphorz
Comment options

@metaphorz
Comment options

@htoyryla
Comment options

@metaphorz
Comment options

Comment options

You must be logged in to vote
1 reply
@metaphorz
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@htoyryla
Comment options

Comment options

You must be logged in to vote
1 reply
@htoyryla
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants