Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Naming: n_local_heads -> n_kv_heads #162

Open
ad8e opened this issue Apr 23, 2024 · 0 comments
Open

Naming: n_local_heads -> n_kv_heads #162

ad8e opened this issue Apr 23, 2024 · 0 comments

Comments

@ad8e
Copy link

ad8e commented Apr 23, 2024

n_local_heads refers to TP sharding, rather than GQA.

@ad8e ad8e changed the title Naming: n_local_heads -> n_kv_heads Naming: n_local_heads -> n_kv_head Apr 23, 2024
@ad8e ad8e changed the title Naming: n_local_heads -> n_kv_head Naming: n_local_heads -> n_kv_heads Apr 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant