Skip to content

Actions: allenai/reward-bench

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
559 workflow runs
559 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update per token reward
Tests #59: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 18:40 4m 23s update/per-token-reward
February 19, 2024 18:40 4m 23s
Update per token reward
Tests #58: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 18:40 4m 7s update/per-token-reward
February 19, 2024 18:40 4m 7s
Update per token reward
Tests #57: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 18:37 4m 3s update/per-token-reward
February 19, 2024 18:37 4m 3s
Update per token reward
Tests #56: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 18:36 3m 42s update/per-token-reward
February 19, 2024 18:36 3m 42s
Update per token reward
Tests #55: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 18:35 3m 46s update/per-token-reward
February 19, 2024 18:35 3m 46s
Update per token reward
Tests #54: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 17:47 3m 48s update/per-token-reward
February 19, 2024 17:47 3m 48s
Best of N pipeline + tests
Tests #53: Pull request #30 synchronize by natolambert
February 16, 2024 00:47 3m 51s b_o_n
February 16, 2024 00:47 3m 51s
Add model type to results (#26)
Tests #52: Commit 060d9c2 pushed by natolambert
February 15, 2024 22:44 4m 8s main
February 15, 2024 22:44 4m 8s
Best of N pipeline + tests
Tests #51: Pull request #30 synchronize by natolambert
February 15, 2024 22:41 3m 39s b_o_n
February 15, 2024 22:41 3m 39s
Best of N pipeline + tests
Tests #50: Pull request #30 synchronize by natolambert
February 15, 2024 22:36 4m 59s b_o_n
February 15, 2024 22:36 4m 59s
Best of N pipeline + tests
Tests #49: Pull request #30 synchronize by natolambert
February 15, 2024 22:00 3m 49s b_o_n
February 15, 2024 22:00 3m 49s
Best of N pipeline + tests
Tests #48: Pull request #30 opened by natolambert
February 15, 2024 21:41 4m 0s b_o_n
February 15, 2024 21:41 4m 0s
Per token multiple rms
Tests #47: Pull request #29 opened by khyathiraghavi
February 15, 2024 21:31 3m 43s per-token-multiple-rms
February 15, 2024 21:31 3m 43s
Per token multiple rms
Tests #46: Pull request #28 opened by khyathiraghavi
February 15, 2024 21:25 3m 45s per-token-multiple-rms
February 15, 2024 21:25 3m 45s
visualizing multiple rewards
Tests #45: Pull request #27 opened by khyathiraghavi
February 15, 2024 21:19 4m 23s per-token-multiple-rms
February 15, 2024 21:19 4m 23s
Add model type to results
Tests #44: Pull request #26 synchronize by natolambert
February 15, 2024 17:45 4m 6s model_type
February 15, 2024 17:45 4m 6s
Add model type to results
Tests #43: Pull request #26 opened by natolambert
February 15, 2024 17:44 4m 36s model_type
February 15, 2024 17:44 4m 36s
Update per token reward
Tests #42: Pull request #25 opened by ljvmiranda921
February 14, 2024 20:32 4m 5s update/per-token-reward
February 14, 2024 20:32 4m 5s
Clean repo (#23)
Tests #41: Commit 84c0a9b pushed by natolambert
February 14, 2024 16:49 4m 27s main
February 14, 2024 16:49 4m 27s
Clean repo
Tests #40: Pull request #23 opened by natolambert
February 13, 2024 22:22 4m 57s clean
February 13, 2024 22:22 4m 57s
Merge pull request #21 from allenai/save_scores
Tests #39: Commit f441f9c pushed by natolambert
February 13, 2024 01:01 3m 51s main
February 13, 2024 01:01 3m 51s
Save scores per prompt
Tests #38: Pull request #21 synchronize by natolambert
February 12, 2024 23:46 4m 59s save_scores
February 12, 2024 23:46 4m 59s
Save scores per prompt
Tests #37: Pull request #21 synchronize by natolambert
February 12, 2024 23:27 4m 4s save_scores
February 12, 2024 23:27 4m 4s
Save scores per prompt
Tests #36: Pull request #21 opened by natolambert
February 12, 2024 23:08 3m 54s save_scores
February 12, 2024 23:08 3m 54s
Merge pull request #20 from allenai/data_formatting
Tests #35: Commit a4eec4a pushed by natolambert
February 12, 2024 22:18 4m 2s main
February 12, 2024 22:18 4m 2s
ProTip! You can narrow down the results and go further in time using created:<2024-02-12 or the other filters available.