Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shortcut accelarate训练加速 #72

Open
Esther-qian opened this issue Oct 14, 2024 · 0 comments
Open

Shortcut accelarate训练加速 #72

Esther-qian opened this issue Oct 14, 2024 · 0 comments

Comments

@Esther-qian
Copy link

Why add a de-noising task to speed up training? In transformer, no attention is paid to the difference between the noised query and the original query, and the final loss is calculated separately. In addition, at what node does the noised query affect the accelerated matching of the original query?为什么添加一个去噪任务,就可以使得训练加速,在transformer中加噪的query和原始query之间不做注意,并且最后的loss是分开计算的,那加噪的query具体在什么节点上对原始query的加速匹配产生了影响?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant