generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add
generation_kwargs
as a property of GRPOConfig
to support additional generation arguments.
#3617
opened Jun 18, 2025 by
pramodith
Loading…
4 of 5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
[GRPO] Fix prompt truncation (
max_prompt_length
) with vLLM.
#3601
opened Jun 16, 2025 by
LeonEricsson
Loading…
2 of 5 tasks
📜 Add
chat_template_source
parameter to SFTConfig
#3599
opened Jun 16, 2025 by
qgallouedec
Loading…
5 tasks
Add entropy based filtering inside the GRPOTrainer.
#3563
opened Jun 10, 2025 by
pramodith
Loading…
4 of 5 tasks
🎀 New defaults:
gradient_checkpointing=True
#3510
opened May 29, 2025 by
qgallouedec
Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508
opened May 29, 2025 by
shaischaudhry
Loading…
3 of 5 tasks
[GRPO] Pad per minibatch instead of per generation batch
#3495
opened May 26, 2025 by
edbeeching
•
Draft
3 tasks
feat(grpo): validate gradient_accumulation_steps vs steps_per_generation for on-policy GRPO
#3493
opened May 25, 2025 by
HarryHsing
Loading…
Allow an user to train from a local dataset
#3470
opened May 19, 2025 by
gogo2464
Loading…
1 of 5 tasks
Add async tool-enabled vLLM server for GRPO training via OpenAI-compatible interface
#3469
opened May 19, 2025 by
BjarniHaukur
Loading…
5 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-06-16.