We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hypformer中在应用注意力机制时似乎只计算了(K_T)V,这种做法在双曲空间中是有效的吗?感觉在双曲空间中计算两个点的相似性用双曲距离更或者切空间内积(例如文中的公式(5))合理一些,请问作者对这个问题有好的解释吗?不胜感激!
The text was updated successfully, but these errors were encountered:
Thanks for your questions. Will make a response shortly!
Sorry, something went wrong.
No branches or pull requests
Hypformer中在应用注意力机制时似乎只计算了(K_T)V,这种做法在双曲空间中是有效的吗?感觉在双曲空间中计算两个点的相似性用双曲距离更或者切空间内积(例如文中的公式(5))合理一些,请问作者对这个问题有好的解释吗?不胜感激!
The text was updated successfully, but these errors were encountered: