[Bug Report] Padding side inconsistency with Huggingface Transformers #801
Labels
bug
Something isn't working
complexity-moderate
Moderately complicated issues for people who have intermediate experience with the code
needs-investigation
Issues that need to be recreated, or investigated before work can be done
Describe the bug
The HookedTransformer tokenizer has the padding side set to "right" for Gemma 2 2b. However, the huggingface autotokenizer has the padding side set to "left." I'm not sure why these are inconsistent.
Code example
Output:
System Info
Linux system: installed using pip in a Python 3.10.12 virtualenv. Package versions are:
Checklist
The text was updated successfully, but these errors were encountered: