Skip to content

Commit

Permalink
Merge pull request #12 from allenai/beaver
Browse files Browse the repository at this point in the history
Add Beaver model from PKU-Alignment
  • Loading branch information
natolambert authored Feb 8, 2024
2 parents 5a8f218 + 1bf0493 commit 17545a7
Show file tree
Hide file tree
Showing 3 changed files with 501 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,8 @@ python scripts/run_rm.py --model=weqweasdas/hh_rlhf_rm_open_llama_3b --chat_temp
python scripts/run_rm.py --model=llm-blender/PairRM-hf --direct_load
python scripts/run_rm.py --model=berkeley-nest/Starling-RM-7B-alpha --tokenizer=meta-llama/Llama-2-7b-chat-hf --chat_template=llama-2 --direct_load --batch_size=16
python scripts/run_rm.py --model=stanfordnlp/SteamSHP-flan-t5-xl --direct_load --batch_size=32
python scripts/run_rm.py --model=PKU-Alignment/beaver-7b-v1.0-reward --chat_template=pku-align --direct_load --batch_size=16
python scripts/run_rm.py --model=PKU-Alignment/beaver-7b-v1.0-cost --chat_template=pku-align --direct_load --batch_size=16
```

And for DPO:
Expand Down
Loading

0 comments on commit 17545a7

Please sign in to comment.