Labels
Labels
32 labels
- Additional information or clarification is required to proceed
- Related to PEFT
- Related to accelerate
- New feature or request
- Seeking clarification or more information
- Related to DPO
- Related to DDPO
- Related to GKD
- Related to GRPO
- Related to Iterative SFT
- Related to KTO
- Related to Online DPO
- Related to ORPO
- Related to PPO
- Related to PRM
- Related to Reward modelling
- Related to RLOO
- Related to SFT
- Related to XPO
- Something isn't working
- Related to Visual Language Models
- This issue or pull request already exists
- Related to judges
- Good for newcomers
- Improvements or additions to documentation
- Related to the Command-line interface
- Related to data
- No update from the author, will be closed soon
- Open invitation for community members to contribute
- Related to deepspeed