Skip to content

efrick2002/Starling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Starling-7B: Improving Helpfulness and Harmlessness with RLAIF

This is the code base for the Starling project from UC Berkeley including:

We include code for the full pipeline: from dataset curation to reward model training to PPO finetuning.

The code base is split into 3 parts:

  1. Nectar: All code pertaining to dataset curation, including prompt sourcing, response distillation, and judgment curation.
  2. Reward Model Training: All code pertaining to reward model training using the Nectar dataset.
  3. trlx: All code pertaining to PPO finetuning, a customized fork of the original trlx codebase.*

Each part has its own respective documentation.

*Note that it seems the trlx codebase is no longer maintained. Parts of the code may be outdated or may not be compatible with newer systems.

Citation

@misc{
starling2024,
title = {Starling-7B: Improving Helpfulness and Harmlessness with RLAIF},
author = {Zhu, Banghua and Frick, Evan and Wu, Tianhao and Zhu, Hanlin and Ganesan, Karthik and Chiang, Wei-lin and Zhang, Jian and Jiao, Jiantao},
booktitle = {First Conference on Language Modeling},
year = {2024},
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published