Skip to content

Commit

Permalink
Merge pull request #877 from wenxindongwork:gemma-models-allow-flash-…
Browse files Browse the repository at this point in the history
…attention

PiperOrigin-RevId: 673455408
  • Loading branch information
maxtext authors committed Sep 11, 2024
2 parents 7631466 + 170e1e4 commit d2c7a2e
Show file tree
Hide file tree
Showing 3 changed files with 0 additions and 3 deletions.
1 change: 0 additions & 1 deletion MaxText/configs/models/gemma2-27b.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,6 @@ vocab_size: 256128
decoder_block: "gemma2"
normalization_layer_epsilon: 1.e-06
logits_via_embedding: True
attention: "dot_product"
final_logits_soft_cap: 30.0
attn_logits_soft_cap: 50.0
sliding_window_size: 4096
Expand Down
1 change: 0 additions & 1 deletion MaxText/configs/models/gemma2-2b.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,6 @@ vocab_size: 256128
decoder_block: "gemma2"
normalization_layer_epsilon: 1.e-06
logits_via_embedding: True
attention: "dot_product"
final_logits_soft_cap: 30.0
attn_logits_soft_cap: 50.0
sliding_window_size: 4096
Expand Down
1 change: 0 additions & 1 deletion MaxText/configs/models/gemma2-9b.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,6 @@ vocab_size: 256128
decoder_block: "gemma2"
normalization_layer_epsilon: 1.e-06
logits_via_embedding: True
attention: "dot_product"
final_logits_soft_cap: 30.0
attn_logits_soft_cap: 50.0
sliding_window_size: 4096
Expand Down

0 comments on commit d2c7a2e

Please sign in to comment.