INFO: Found overflow. Skip step #5136
Unanswered
stephencurry-web
asked this question in
Community | Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I trained Llama2-7B chat on the Alpaca dataset, and when I set the batch size to 2 or 4, INFO: Found overflow appeared at each step of the entire training process Skip step, And the gradient is nan, which is normal when I set the batch size to 1. May I ask what the reason is?
Beta Was this translation helpful? Give feedback.
All reactions