huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 28k
Star 140k

Code
Issues 996
Pull requests 535
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: huggingface/transformers

Labels 131 Milestones 0

New pull request New

535 Open 18,510 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add missing atol to torch.testing.assert_close where rtol is specified

#36234 opened Feb 17, 2025 by ivarflakstad

Loading…

[tests] fix more device bugs

#36233 opened Feb 17, 2025 by faaany • Draft

Add evolla rebase main

#36232 opened Feb 17, 2025 by zhoubay

Loading…

5 tasks done

Prevent Reinitialization of Resized LM Head When tie_word_embeddings is False #35141

#36221 opened Feb 16, 2025 by sambhavnoobcoder

Loading…

fix: prevent second save in the end of training if last step was saved already

#36219 opened Feb 16, 2025 by NosimusAI

Loading…

2 of 5 tasks

Improvements in attention_forward functions

#36218 opened Feb 15, 2025 by mseeger

Loading…

3 of 5 tasks

[WIP] Add a dedicated tokenizer for byte level transformers

#36216 opened Feb 15, 2025 by apehex

Loading…

🔴 [generate] default max_new_tokens

#36215 opened Feb 15, 2025 by gante

Loading…

Fix the eval_use_gather_object flag usage

#36214 opened Feb 15, 2025 by ducha-aiki

Loading…

[generate] remove cache v4.47 deprecations

#36212 opened Feb 15, 2025 by gante

Loading…

fix: condition bos_token_id and space as token

#36211 opened Feb 15, 2025 by desaxce

Loading…

Change Qwen2_VL image processors to have init and call accept the same kwargs

#36207 opened Feb 14, 2025 by yonigozlan

Loading…

Fix TorchAoConfig not JSON serializable

#36206 opened Feb 14, 2025 by andrewor14

Loading…

Append best model checkpoint with active adapter when not default

#36201 opened Feb 14, 2025 by Thomas26948

Loading…

1 of 5 tasks

Fix AutoProcessor loading error

#36199 opened Feb 14, 2025 by JJJYmmm

Loading…

1 of 5 tasks

Fixed dynamic module import when there is more than one dit in class …

#36198 opened Feb 14, 2025 by ExtReMLapin

Loading…

(ugly) Use parallelism=4 for check_repository_consistency

#36197 opened Feb 14, 2025 by ydshieh

Loading…

Flash Attention v3

#36190 opened Feb 14, 2025 by hlky • Draft

Qwen2VL fix cos,sin dtypes to float when used with deepspeed

#36188 opened Feb 14, 2025 by ArdalanM

Loading…

5 tasks

Remove differences between init and preprocess kwargs for fast image processors

#36186 opened Feb 13, 2025 by yonigozlan

Loading…

Add Got-OCR 2 Fast image processor and refactor slow one

#36185 opened Feb 13, 2025 by yonigozlan

Loading…

Try working around the processor registration bugs

#36184 opened Feb 13, 2025 by Rocketknight1 • Draft

Add MLCD model New model Vision

#36182 opened Feb 13, 2025 by tanhuajie

Loading…

5 tasks done

[CI] Check test if the GenerationTesterMixin inheritance is correct 🐛 🔫

#36180 opened Feb 13, 2025 by gante

Loading…

Just import torch AdamW instead

#36177 opened Feb 13, 2025 by Rocketknight1

Loading…

Previous 1 2 3 4 5 … 21 22 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly