Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce alpha,beta unnecessary d2h #17

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

sleepcoo
Copy link

@sleepcoo sleepcoo commented Sep 20, 2023

When I use this library, I find that there are many D2H copies of 4bytes, which occurs in self.a.item (), since a needs to be worn back to the cpu in the gpu memory to fetch the value, I fixed alpha and beta in the cpu after initialization. Fixed this bug, which improved by 6% in my test scence(It is not strictly compared, but the whole under the llm model.).
@Guangxuan-Xiao

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant