Skip to content

Actions: allenai/reward-bench

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
559 workflow runs
559 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Clean up model loading system (#36)
Tests #109: Commit 7ae5c10 pushed by natolambert
February 23, 2024 01:49 5m 2s main
February 23, 2024 01:49 5m 2s
Clean up model loading system
Tests #108: Pull request #36 synchronize by natolambert
February 23, 2024 00:04 3m 58s model_handling
February 23, 2024 00:04 3m 58s
Clean up model loading system
Tests #107: Pull request #36 synchronize by natolambert
February 22, 2024 23:59 4m 34s model_handling
February 22, 2024 23:59 4m 34s
DPO
Tests #106: Pull request #31 synchronize by ValentinaPy
February 22, 2024 23:09 4m 29s valpy-dpo
February 22, 2024 23:09 4m 29s
DPO
Tests #105: Pull request #31 synchronize by ValentinaPy
February 22, 2024 22:51 3m 39s valpy-dpo
February 22, 2024 22:51 3m 39s
DPO
Tests #104: Pull request #31 synchronize by ValentinaPy
February 22, 2024 22:51 4m 34s valpy-dpo
February 22, 2024 22:51 4m 34s
DPO
Tests #103: Pull request #31 synchronize by ValentinaPy
February 22, 2024 22:46 3m 49s valpy-dpo
February 22, 2024 22:46 3m 49s
Clean up model loading system
Tests #102: Pull request #36 synchronize by natolambert
February 22, 2024 22:35 3m 48s model_handling
February 22, 2024 22:35 3m 48s
Clean up model loading system
Tests #101: Pull request #36 opened by natolambert
February 22, 2024 22:33 4m 10s model_handling
February 22, 2024 22:33 4m 10s
Best of N pipeline + tests (#30)
Tests #100: Commit 3f34a36 pushed by natolambert
February 22, 2024 20:11 4m 30s main
February 22, 2024 20:11 4m 30s
Best of N pipeline + tests
Tests #99: Pull request #30 synchronize by natolambert
February 22, 2024 20:07 3m 58s b_o_n
February 22, 2024 20:07 3m 58s
Best of N pipeline + tests
Tests #98: Pull request #30 synchronize by natolambert
February 22, 2024 19:56 3m 51s b_o_n
February 22, 2024 19:56 3m 51s
Add linechart capability for per-token rewards (#33)
Tests #97: Commit cf82f2a pushed by ljvmiranda921
February 22, 2024 07:11 3m 46s main
February 22, 2024 07:11 3m 46s
Best of N pipeline + tests
Tests #96: Pull request #30 synchronize by natolambert
February 21, 2024 23:16 3m 58s b_o_n
February 21, 2024 23:16 3m 58s
Best of N pipeline + tests
Tests #95: Pull request #30 synchronize by natolambert
February 21, 2024 22:22 4m 54s b_o_n
February 21, 2024 22:22 4m 54s
Best of N pipeline + tests
Tests #94: Pull request #30 synchronize by natolambert
February 21, 2024 21:55 3m 46s b_o_n
February 21, 2024 21:55 3m 46s
Add linechart capability for per-token rewards
Tests #93: Pull request #33 synchronize by ljvmiranda921
February 21, 2024 17:08 4m 42s per-token-line
February 21, 2024 17:08 4m 42s
Add linechart capability for per-token rewards
Tests #92: Pull request #33 synchronize by ljvmiranda921
February 21, 2024 00:22 4m 14s per-token-line
February 21, 2024 00:22 4m 14s
Plot subset distribution across all models (#32)
Tests #91: Commit 2dbe89c pushed by natolambert
February 20, 2024 23:50 3m 44s main
February 20, 2024 23:50 3m 44s
Add linechart capability for per-token rewards
Tests #90: Pull request #33 opened by ljvmiranda921
February 20, 2024 23:32 4m 29s per-token-line
February 20, 2024 23:32 4m 29s
Best of N pipeline + tests
Tests #89: Pull request #30 synchronize by natolambert
February 20, 2024 22:31 4m 27s b_o_n
February 20, 2024 22:31 4m 27s
Plot subset distribution across all models
Tests #88: Pull request #32 synchronize by natolambert
February 20, 2024 22:07 3m 57s plot_subsets
February 20, 2024 22:07 3m 57s
Plot subset distribution across all models
Tests #87: Pull request #32 opened by natolambert
February 20, 2024 22:00 5m 1s plot_subsets
February 20, 2024 22:00 5m 1s
DPO
Tests #86: Pull request #31 synchronize by ValentinaPy
February 20, 2024 19:56 4m 49s valpy-dpo
February 20, 2024 19:56 4m 49s
DPO
Tests #85: Pull request #31 synchronize by ValentinaPy
February 20, 2024 19:52 3m 47s valpy-dpo
February 20, 2024 19:52 3m 47s
ProTip! You can narrow down the results and go further in time using created:<2024-02-20 or the other filters available.