We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
因为现在手头只有v100的机器,所以训练的时候尝试用了fp16(bf16有点慢)。
但是发现用fp16实质上似乎没有训练,
Megatron-LLaMA/megatron/optimizer/optimizer.py
Line 433 in 25306de
同样的数据集bf16的情况我跑过,没有这个问题。我也修改--initial-loss-scale到一个比较小的值也不行。
The text was updated successfully, but these errors were encountered:
抱歉,fp16 验证的比较少,我们近期会看看。
Sorry, something went wrong.
No branches or pull requests
因为现在手头只有v100的机器,所以训练的时候尝试用了fp16(bf16有点慢)。
但是发现用fp16实质上似乎没有训练,
Megatron-LLaMA/megatron/optimizer/optimizer.py
Line 433 in 25306de
同样的数据集bf16的情况我跑过,没有这个问题。我也修改--initial-loss-scale到一个比较小的值也不行。
The text was updated successfully, but these errors were encountered: