Skip to content

Commit

Permalink
fix num_kv_heads sharding in autoTP for the new in-repo Falcon-40B (#…
Browse files Browse the repository at this point in the history
  • Loading branch information
3 people authored Nov 10, 2023
1 parent a361bac commit 6ea44d0
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion deepspeed/module_inject/auto_tp.py
Original file line number Diff line number Diff line change
Expand Up @@ -385,7 +385,7 @@ def update_mp_params(self, child):
return
for param in [
"n_heads", "inner_dim", "num_heads", "num_kv", "num_attention_heads", "num_attn_heads",
"all_head_size", "embed_dim", "hidden_size", "num_key_value_heads"
"all_head_size", "embed_dim", "hidden_size", "num_key_value_heads", "num_kv_heads"
]:
if hasattr(child, param):
param_val = getattr(child, param)
Expand Down

0 comments on commit 6ea44d0

Please sign in to comment.