Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Epic] Data Pipeline for RLHF Tuning #392

Open
4 tasks
RobotSail opened this issue Dec 7, 2024 · 0 comments
Open
4 tasks

[Epic] Data Pipeline for RLHF Tuning #392

RobotSail opened this issue Dec 7, 2024 · 0 comments
Labels
enhancement New feature or request epic Features will contain multiple smaller enhancements, bugs, documentation, or PoCs
Milestone

Comments

@RobotSail
Copy link
Member

RobotSail commented Dec 7, 2024

In order to perform RLHF, we would like to collect feedback on human preference when tuning models. In order to accomplish this, there are a number of steps which must first be completed to allow the UI to support this.

We define the epic as follows:

The implementations are left as exercises for the reader

@vishnoianil vishnoianil added enhancement New feature or request epic Features will contain multiple smaller enhancements, bugs, documentation, or PoCs labels Dec 17, 2024
@vishnoianil vishnoianil added this to UI Dec 17, 2024
@vishnoianil vishnoianil moved this to Backlog in UI Dec 17, 2024
@vishnoianil vishnoianil added this to the release-1.2 milestone Dec 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request epic Features will contain multiple smaller enhancements, bugs, documentation, or PoCs
Projects
Status: Backlog
Development

No branches or pull requests

2 participants