From 60912658f1d25b08e792792629f12db2b2bac834 Mon Sep 17 00:00:00 2001 From: Nathan Lambert Date: Wed, 7 Feb 2024 03:34:57 +0000 Subject: [PATCH] nit --- analysis/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/analysis/README.md b/analysis/README.md index 5388ea8e..61777a33 100644 --- a/analysis/README.md +++ b/analysis/README.md @@ -6,7 +6,7 @@ So far, we have the following tools: ### Per token uterrance reward This returns the reward per-token to show how the reward evolves over a piece of text. ``` -python analysis/per_token_reward.py --model=OpenAssistant/reward-model-deberta-v3-large-v2 --chat_template=raw --text="I love to walk the dog, what do you like?" +python analysis/per_token_reward.py --model=OpenAssistant/reward-model-deberta-v3-large-v2 --text="I love to walk the dog, what do you like?" ``` E.g. with OpenAssistant/reward-model-deberta-v3-large-v2 Reward: -0.544 | Substring: I