From be22e35ab306d47b394d8dd47779d8460dc5b150 Mon Sep 17 00:00:00 2001 From: PhilipMay Date: Fri, 29 Dec 2023 15:12:18 +0100 Subject: [PATCH] llm --- source/machine-learning/llm.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/source/machine-learning/llm.md b/source/machine-learning/llm.md index 6107792..df726a6 100644 --- a/source/machine-learning/llm.md +++ b/source/machine-learning/llm.md @@ -2,6 +2,10 @@ ## Base Knowledge +- [HF: The Alignment Handbook](https://github.com/huggingface/alignment-handbook) + +## Specific Techniques + - Direct Preference Optimization (DPO) - Paper: -