[Epic] Data Pipeline for RLHF Tuning #392
Labels
enhancement
New feature or request
epic
Features will contain multiple smaller enhancements, bugs, documentation, or PoCs
Milestone
In order to perform RLHF, we would like to collect feedback on human preference when tuning models. In order to accomplish this, there are a number of steps which must first be completed to allow the UI to support this.
We define the epic as follows:
The implementations are left as exercises for the reader
The text was updated successfully, but these errors were encountered: