Skip to content

Commit

Permalink
llm
Browse files Browse the repository at this point in the history
  • Loading branch information
PhilipMay committed Dec 29, 2023
1 parent 6114607 commit b8f53ae
Showing 1 changed file with 9 additions and 0 deletions.
9 changes: 9 additions & 0 deletions source/machine-learning/llm.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,16 @@
# LLM

## Base Knowledge

- Direct Preference Optimization (DPO)
- Paper: <https://arxiv.org/abs/2305.18290>
- <https://plainenglish.io/community/direct-preference-optimization-dpo-a-simplified-approach-to-fine-tuning-large-language-models>
- <https://huggingface.co/blog/dpo-trl>

## Specific Models

- Argilla Notux
- based on Mixtral
- HF Model: <https://huggingface.co/argilla/notux-8x7b-v1>
- Dataset: <https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned>
- Code: <https://github.com/argilla-io/notus>

0 comments on commit b8f53ae

Please sign in to comment.