Skip to content

Commit

Permalink
fix: Llama AWS with GQA (#114)
Browse files Browse the repository at this point in the history
  • Loading branch information
tgaddair authored Dec 8, 2023
1 parent 41c657e commit 6b65aac
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -184,7 +184,7 @@ def _load_gqa(config, prefix: str, weights):
dim=0,
)

if config.quantize not in ["gptq"]:
if config.quantize not in ["gptq", "awq"]:
weight = weight.to(dtype=weights.dtype).to(device=weights.device)

head_size = config.hidden_size // config.num_attention_heads
Expand Down

0 comments on commit 6b65aac

Please sign in to comment.