Does kohya suffer from this gradient accumulation bug? #1716

ilazarte · 2024-10-21T14:57:42Z

ilazarte
Oct 21, 2024

Hello, this blog post describes a supposedly universal gradient accumulation bug, and I was wondering if someone was knowledgeable enough to know whether kohya training scripts (including this repo) suffers from this or it doesn't apply: https://unsloth.ai/blog/gradient

Answered by kohya-ss

Oct 21, 2024

From my understanding, this issue only occurs with cross entropy loss. Currently, in this repository, we use MSE loss, so this issue should not occur.

View full answer

kohya-ss · 2024-10-21T23:04:38Z

kohya-ss
Oct 21, 2024
Maintainer

From my understanding, this issue only occurs with cross entropy loss. Currently, in this repository, we use MSE loss, so this issue should not occur.

1 reply

ilazarte Oct 30, 2024
Author

thanks for reviewing!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does kohya suffer from this gradient accumulation bug? #1716

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Does kohya suffer from this gradient accumulation bug? #1716

ilazarte Oct 21, 2024

Replies: 1 comment · 1 reply

kohya-ss Oct 21, 2024 Maintainer

ilazarte Oct 30, 2024 Author

ilazarte
Oct 21, 2024

Replies: 1 comment 1 reply

kohya-ss
Oct 21, 2024
Maintainer

ilazarte Oct 30, 2024
Author