-
Notifications
You must be signed in to change notification settings - Fork 30.7k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add max_eval_batches argument to TrainingArguments
#41524
opened Oct 11, 2025 by
KaparthyReddy
Loading…
Add test coverage for ConvNextImageProcessorFast
#41523
opened Oct 11, 2025 by
KaparthyReddy
Loading…
Fix _init_weights to safely skip int8 quantized weights
#41522
opened Oct 11, 2025 by
KaparthyReddy
Loading…
Fix forced_bos_token_id not set in generation_config
#41521
opened Oct 11, 2025 by
Addyk-24
Loading…
2 of 5 tasks
[ci] Disable workflows with secrets and custom runners to run on fork
#41515
opened Oct 10, 2025 by
HollowMan6
Loading…
1 of 5 tasks
[don't merge yet] Remove some custom datasets defined in codebase
#41511
opened Oct 10, 2025 by
ydshieh
Loading…
🚨 [v5]
generate
delegates default cache initialization to the model
#41505
opened Oct 10, 2025 by
gante
Loading…
🌐 [i18n-KO] Translated
ko-LFM2.md
to Korean
#41502
opened Oct 10, 2025 by
ssum21
Loading…
10 tasks done
Add skip_unnecessary_grad_clip to TrainingArguments for optimized gradient clipping
#41491
opened Oct 9, 2025 by
vaibhavgarg230
Loading…
3 tasks done
Fix _init_weights to safely skip int8 tensors in Qwen2_5_VL model
#41490
opened Oct 9, 2025 by
KaparthyReddy
Loading…
🚨 [v5] Toggle the serialization format in processors
#41474
opened Oct 9, 2025 by
zucchini-nlp
Loading…
Adding ScatterMoE kernel support for Granite models.
#41458
opened Oct 8, 2025 by
shawntan
Loading…
1 of 5 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-11.