Skip to content

Commit

Permalink
llm
Browse files Browse the repository at this point in the history
  • Loading branch information
PhilipMay committed Dec 29, 2023
1 parent b8f53ae commit be22e35
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions source/machine-learning/llm.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,10 @@

## Base Knowledge

- [HF: The Alignment Handbook](https://github.com/huggingface/alignment-handbook)

## Specific Techniques

- Direct Preference Optimization (DPO)
- Paper: <https://arxiv.org/abs/2305.18290>
- <https://plainenglish.io/community/direct-preference-optimization-dpo-a-simplified-approach-to-fine-tuning-large-language-models>
Expand Down

0 comments on commit be22e35

Please sign in to comment.