-
Notifications
You must be signed in to change notification settings - Fork 253
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
第一个step后第三阶段loss变为nan #157
Comments
此外如果我把fp16设置为True会得到
|
可以试试把语言模型设置成bf16,混合精度数据类型也改成bf16,fp16在某些情况容易nan |
我也有一样的问题,你解决了吗 |
According to the author's message, after changing to bf16 the loss will no longer be nan. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
这个我stage3配置信息
然后我得到错误,loss为nan
The text was updated successfully, but these errors were encountered: