Skip to content

Actions: allenai/reward-bench

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,147 workflow runs
1,147 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Best of N pipeline + tests
Quality #49: Pull request #30 synchronize by natolambert
February 15, 2024 22:00 3m 30s b_o_n
February 15, 2024 22:00 3m 30s
Best of N pipeline + tests
Tests #48: Pull request #30 opened by natolambert
February 15, 2024 21:41 4m 0s b_o_n
February 15, 2024 21:41 4m 0s
Best of N pipeline + tests
Quality #48: Pull request #30 opened by natolambert
February 15, 2024 21:41 3m 35s b_o_n
February 15, 2024 21:41 3m 35s
Per token multiple rms
Tests #47: Pull request #29 opened by khyathiraghavi
February 15, 2024 21:31 3m 43s per-token-multiple-rms
February 15, 2024 21:31 3m 43s
Per token multiple rms
Quality #47: Pull request #29 opened by khyathiraghavi
February 15, 2024 21:31 3m 25s per-token-multiple-rms
February 15, 2024 21:31 3m 25s
Per token multiple rms
Quality #46: Pull request #28 opened by khyathiraghavi
February 15, 2024 21:25 3m 36s per-token-multiple-rms
February 15, 2024 21:25 3m 36s
Per token multiple rms
Tests #46: Pull request #28 opened by khyathiraghavi
February 15, 2024 21:25 3m 45s per-token-multiple-rms
February 15, 2024 21:25 3m 45s
visualizing multiple rewards
Quality #45: Pull request #27 opened by khyathiraghavi
February 15, 2024 21:19 3m 40s per-token-multiple-rms
February 15, 2024 21:19 3m 40s
visualizing multiple rewards
Tests #45: Pull request #27 opened by khyathiraghavi
February 15, 2024 21:19 4m 23s per-token-multiple-rms
February 15, 2024 21:19 4m 23s
Add model type to results
Tests #44: Pull request #26 synchronize by natolambert
February 15, 2024 17:45 4m 6s model_type
February 15, 2024 17:45 4m 6s
Add model type to results
Quality #44: Pull request #26 synchronize by natolambert
February 15, 2024 17:45 3m 28s model_type
February 15, 2024 17:45 3m 28s
Add model type to results
Tests #43: Pull request #26 opened by natolambert
February 15, 2024 17:44 4m 36s model_type
February 15, 2024 17:44 4m 36s
Add model type to results
Quality #43: Pull request #26 opened by natolambert
February 15, 2024 17:44 3m 33s model_type
February 15, 2024 17:44 3m 33s
Update per token reward
Tests #42: Pull request #25 opened by ljvmiranda921
February 14, 2024 20:32 4m 5s update/per-token-reward
February 14, 2024 20:32 4m 5s
Update per token reward
Quality #42: Pull request #25 opened by ljvmiranda921
February 14, 2024 20:32 3m 41s update/per-token-reward
February 14, 2024 20:32 3m 41s
Clean repo (#23)
Quality #41: Commit 84c0a9b pushed by natolambert
February 14, 2024 16:49 4m 0s main
February 14, 2024 16:49 4m 0s
Clean repo (#23)
Tests #41: Commit 84c0a9b pushed by natolambert
February 14, 2024 16:49 4m 27s main
February 14, 2024 16:49 4m 27s
Clean repo
Tests #40: Pull request #23 opened by natolambert
February 13, 2024 22:22 4m 57s clean
February 13, 2024 22:22 4m 57s
Clean repo
Quality #40: Pull request #23 opened by natolambert
February 13, 2024 22:22 3m 39s clean
February 13, 2024 22:22 3m 39s
Merge pull request #21 from allenai/save_scores
Tests #39: Commit f441f9c pushed by natolambert
February 13, 2024 01:01 3m 51s main
February 13, 2024 01:01 3m 51s
Merge pull request #21 from allenai/save_scores
Quality #39: Commit f441f9c pushed by natolambert
February 13, 2024 01:01 4m 17s main
February 13, 2024 01:01 4m 17s
Save scores per prompt
Quality #38: Pull request #21 synchronize by natolambert
February 12, 2024 23:46 3m 37s save_scores
February 12, 2024 23:46 3m 37s
Save scores per prompt
Tests #38: Pull request #21 synchronize by natolambert
February 12, 2024 23:46 4m 59s save_scores
February 12, 2024 23:46 4m 59s
Save scores per prompt
Quality #37: Pull request #21 synchronize by natolambert
February 12, 2024 23:27 3m 44s save_scores
February 12, 2024 23:27 3m 44s
Save scores per prompt
Tests #37: Pull request #21 synchronize by natolambert
February 12, 2024 23:27 4m 4s save_scores
February 12, 2024 23:27 4m 4s
ProTip! You can narrow down the results and go further in time using created:<2024-02-12 or the other filters available.