Skip to content

Actions: allenai/reward-bench

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
559 workflow runs
559 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Change data storage location
Tests #34: Pull request #20 opened by natolambert
February 12, 2024 21:58 4m 22s data_formatting
February 12, 2024 21:58 4m 22s
Merge pull request #18 from allenai/docker-eval
Tests #33: Commit 2f7a287 pushed by jacob-morrison
February 12, 2024 19:57 4m 37s main
February 12, 2024 19:57 4m 37s
Add docker image and script for submitting eval jobs
Tests #32: Pull request #18 synchronize by jacob-morrison
February 12, 2024 19:18 3m 47s docker-eval
February 12, 2024 19:18 3m 47s
Add docker image and script for submitting eval jobs
Tests #31: Pull request #18 synchronize by jacob-morrison
February 12, 2024 19:15 4m 5s docker-eval
February 12, 2024 19:15 4m 5s
Add docker image and script for submitting eval jobs
Tests #30: Pull request #18 synchronize by jacob-morrison
February 12, 2024 18:58 4m 11s docker-eval
February 12, 2024 18:58 4m 11s
Add function to get subtoken statistics (#17)
Tests #29: Commit 85b6a4a pushed by ljvmiranda921
February 9, 2024 22:15 4m 8s main
February 9, 2024 22:15 4m 8s
Merge pull request #13 from allenai/beaver_fix
Tests #28: Commit 0299429 pushed by natolambert
February 9, 2024 19:21 3m 59s main
February 9, 2024 19:21 3m 59s
Beaver fix; working towards another model
Tests #27: Pull request #13 synchronize by natolambert
February 9, 2024 19:15 3m 43s beaver_fix
February 9, 2024 19:15 3m 43s
Add function to get subtoken statistics
Tests #26: Pull request #17 opened by ljvmiranda921
February 9, 2024 18:38 3m 52s add/subtoken-counter
February 9, 2024 18:38 3m 52s
Fix code formatting (#15)
Tests #25: Commit a4a5f38 pushed by ljvmiranda921
February 9, 2024 04:47 4m 41s main
February 9, 2024 04:47 4m 41s
Fix table loading from viewer updates (#14)
Tests #24: Commit 3320617 pushed by ljvmiranda921
February 9, 2024 00:52 4m 5s main
February 9, 2024 00:52 4m 5s
Beaver fix; working towards another model
Tests #23: Pull request #13 synchronize by natolambert
February 8, 2024 18:43 4m 10s beaver_fix
February 8, 2024 18:43 4m 10s
Beaver fix; working towards another model
Tests #22: Pull request #13 opened by natolambert
February 8, 2024 18:04 4m 0s beaver_fix
February 8, 2024 18:04 4m 0s
Merge pull request #12 from allenai/beaver
Tests #21: Commit 17545a7 pushed by natolambert
February 8, 2024 17:53 4m 2s main
February 8, 2024 17:53 4m 2s
Add Beaver model from PKU-Alignment
Tests #20: Pull request #12 synchronize by natolambert
February 8, 2024 04:47 3m 57s beaver
February 8, 2024 04:47 3m 57s
Add Beaver model from PKU-Alignment
Tests #19: Pull request #12 synchronize by natolambert
February 8, 2024 04:27 3m 50s beaver
February 8, 2024 04:27 3m 50s
Add Beaver model from PKU-Alignment
Tests #18: Pull request #12 opened by natolambert
February 8, 2024 04:08 3m 30s beaver
February 8, 2024 04:08 3m 30s
Add functionality to report results on Markdown tables (#10)
Tests #17: Commit 5a8f218 pushed by ljvmiranda921
February 8, 2024 02:01 4m 14s main
February 8, 2024 02:01 4m 14s
Update README.md
Tests #16: Commit 0979835 pushed by natolambert
February 7, 2024 21:21 3m 45s main
February 7, 2024 21:21 3m 45s
Merge pull request #9 from allenai/per_token
Tests #15: Commit ed1bffa pushed by natolambert
February 7, 2024 21:09 3m 25s main
February 7, 2024 21:09 3m 25s
Print per-token reward over an RM
Tests #14: Pull request #9 synchronize by natolambert
February 7, 2024 03:40 4m 21s per_token
February 7, 2024 03:40 4m 21s
Print per-token reward over an RM
Tests #13: Pull request #9 synchronize by natolambert
February 7, 2024 03:35 4m 18s per_token
February 7, 2024 03:35 4m 18s
Print per-token reward over an RM
Tests #12: Pull request #9 opened by natolambert
February 7, 2024 03:31 4m 17s per_token
February 7, 2024 03:31 4m 17s
Merge pull request #8 from allenai/fix_tests
Tests #11: Commit 43ab903 pushed by natolambert
February 6, 2024 19:25 4m 1s main
February 6, 2024 19:25 4m 1s
Fix test failing on main
Tests #10: Pull request #8 opened by natolambert
February 6, 2024 19:21 4m 3s fix_tests
February 6, 2024 19:21 4m 3s
ProTip! You can narrow down the results and go further in time using created:<2024-02-06 or the other filters available.