We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
这道题为啥我认为是除以批量大小呢?问了一下 GPT ,也是除以的
The text was updated successfully, but these errors were encountered:
梯度是累积的,貌似就该累积batch大小的学习率?
Sorry, something went wrong.
emmm,我发现这道题其实是逻辑问题,它是问总损失变成了平均损失,那么学习率应该怎么变吧,这个时候学习率应该除以批量的数量吧;
然后说到梯度是累计的,这个应该是指 Pytorch 中的梯度累计吧,这个应该跟这道题关系不大吧
No branches or pull requests
这道题为啥我认为是除以批量大小呢?问了一下 GPT ,也是除以的
The text was updated successfully, but these errors were encountered: