Enhanced gemma prediction with new flawless logit #51

carlofisicaro · 2024-09-21T20:00:29Z

Integration of a flawless_logit

Derived by first computing logits for each token in isolation, normalizing these logits, and then subtracting the normalized sum from the baseline_logits.
Encoding each token individually.
Applying a normalization step (final_norm).
Decoding the normalized tokens to get logits.
Normalizing these logits using a softmax function.
Summing the normalized logits.
Subtracting this sum from the gemma2 original logit

By normalizing the logits for each token, the model ensures that the predictions are more balanced and less likely to be dominated by any single token.
Subtracting the normalized sum can help reduce biases and make the logits more representative of the actual distribution of the data.

Based on initial tests conducted on Gemma2 7B, it appears that the performance at inference time has been improved.

PiperOrigin-RevId: 663277444 Change-Id: I8d7030ce586577a433c48f32df7efa7c141b171a

…ormer_lib.make_causal_attn_mask(input_mask)` PiperOrigin-RevId: 663692225 Change-Id: Ie2cb6229302087ea1ce5b5c7f442a088207ead07

PiperOrigin-RevId: 665414923 Change-Id: I42bc41074518e3065f85c7f1a3014fdd09cffe4c

Currently all weights in FeedForward layers are initialized to zero. This doesn't cause any issues when loading the module with pretrained weights, but if training from scratch it will result in all gradients being zero throughout training so no learning can occur. Changing w_gating be be initialized from a normal distribution fixes this. PiperOrigin-RevId: 674306730 Change-Id: I90800dbe605cdf88f341d103f102357ff278a393

PiperOrigin-RevId: 674394389 Change-Id: I25ba5ad4769c3101c2bf572e33723d4a241e3895

…se errors for implicit rank promotion. PiperOrigin-RevId: 675179053 Change-Id: I55459c1aa99c7d33ae3f03712eaed01ccc5fc9f2

google-cla · 2024-09-21T20:01:06Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

carlofisicaro · 2024-09-26T23:46:06Z

The GitHub CLA check doesn't recognize the noreply user @a-googler <no****ly@google.com>.

How shall I proceed? Should I use an interactive rebase to edit the author of the related commits?

carlofisicaro and others added 18 commits September 18, 2024 14:18

Update README

be9945e

Create .gitignore

ce1afcc

update README

0251732

update README

f0f147e

Update README

9063a3d

update README

e87d5cb

update README

008ff12

feat: add MMLU benchmarking

220b25a

Update mmlu.py

c5f05fe

Fixed tokens to string method in tokenizer.

2ab0824

PiperOrigin-RevId: 663277444 Change-Id: I8d7030ce586577a433c48f32df7efa7c141b171a

Aligns meaning of _compute_attention_masks(input_mask) with `transf…

c706509

…ormer_lib.make_causal_attn_mask(input_mask)` PiperOrigin-RevId: 663692225 Change-Id: Ie2cb6229302087ea1ce5b5c7f442a088207ead07

Fix test_sampler_mask_tokens_after_eos_ids test.

fe43860

PiperOrigin-RevId: 665414923 Change-Id: I42bc41074518e3065f85c7f1a3014fdd09cffe4c

Fix a bug in sliding window attention.

c13384a

PiperOrigin-RevId: 674394389 Change-Id: I25ba5ad4769c3101c2bf572e33723d4a241e3895

Explicitly promote rank when creating sliding mask, as some tests rai…

2e62333

…se errors for implicit rank promotion. PiperOrigin-RevId: 675179053 Change-Id: I55459c1aa99c7d33ae3f03712eaed01ccc5fc9f2

Merge branch 'google-deepmind:main' into dev

511c5b7

build: add dependeces

b58703c

feat: add flawless logit

dcc5ee4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhanced gemma prediction with new flawless logit #51

Enhanced gemma prediction with new flawless logit #51

carlofisicaro commented Sep 21, 2024

google-cla bot commented Sep 21, 2024

carlofisicaro commented Sep 26, 2024

Enhanced gemma prediction with new flawless logit #51

Are you sure you want to change the base?

Enhanced gemma prediction with new flawless logit #51

Conversation

carlofisicaro commented Sep 21, 2024

google-cla bot commented Sep 21, 2024

carlofisicaro commented Sep 26, 2024