Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
natolambert authored Feb 7, 2024
1 parent ed1bffa commit 0979835
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions analysis/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,7 @@ This returns the reward per-token to show how the reward evolves over a piece of
python analysis/per_token_reward.py --model=OpenAssistant/reward-model-deberta-v3-large-v2 --text="I love to walk the dog, what do you like?"
```
E.g. with OpenAssistant/reward-model-deberta-v3-large-v2
```
Reward: -0.544 | Substring: I
Reward: -0.556 | Substring: I love
Reward: -0.566 | Substring: I love to
Expand All @@ -21,7 +22,7 @@ Reward: 0.085 | Substring: I love to walk the dog, what do
Reward: 0.089 | Substring: I love to walk the dog, what do you
Reward: 0.09 | Substring: I love to walk the dog, what do you like
Reward: 0.093 | Substring: I love to walk the dog, what do you like?

```
### Model usage within eval. dataset
To run this, execute:
```
Expand Down Expand Up @@ -80,4 +81,4 @@ This will also return the following table by default:
| tulu-30b | 2 | 2 | 0 |
| vicuna-33b-v1.3 | 1 | 1 | 0 |

Total number of models involved: 44
Total number of models involved: 44

0 comments on commit 0979835

Please sign in to comment.