Skip to content

Actions: huggingface/lighteval

Quality

Actions

Loading...
Loading

Create status badge

Loading
176 workflow run results
176 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Small fix to be able to use extensions of nanotron configs
Quality #188: Pull request #58 opened by thomwolf
February 26, 2024 23:06 16s fixxx-brrr
February 26, 2024 23:06 16s
Tweak installation / usage sections of README (#55)
Quality #187: Commit 480d85e pushed by lewtun
February 26, 2024 14:55 22s main
February 26, 2024 14:55 22s
Tweak installation / usage sections of README
Quality #186: Pull request #55 synchronize by lewtun
February 26, 2024 14:30 17s tweak-readme
February 26, 2024 14:30 17s
February 26, 2024 14:19 16s
Tweak installation / usage sections of README
Quality #184: Pull request #55 opened by lewtun
February 26, 2024 14:14 16s tweak-readme
February 26, 2024 14:14 16s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #181: Pull request #44 synchronize by alielfilali01
February 26, 2024 12:40 20s main
February 26, 2024 12:40 20s
Adding custom metric system + IFEval as an example
Quality #180: Pull request #48 synchronize by clefourrier
February 26, 2024 11:22 16s clem_customizable_metrics
February 26, 2024 11:22 16s
Adding custom metric system + IFEval as an example
Quality #179: Pull request #48 synchronize by NathanHB
February 26, 2024 11:13 19s clem_customizable_metrics
February 26, 2024 11:13 19s
Adding custom metric system + IFEval as an example
Quality #178: Pull request #48 synchronize by clefourrier
February 26, 2024 07:52 18s clem_customizable_metrics
February 26, 2024 07:52 18s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #177: Pull request #44 synchronize by alielfilali01
February 24, 2024 12:01 17s main
February 24, 2024 12:01 17s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #176: Pull request #44 synchronize by NathanHB
February 24, 2024 10:04 18s main
February 24, 2024 10:04 18s
New mechanism for evaluation contributions (#47)
Quality #175: Commit 92e9b50 pushed by NathanHB
February 24, 2024 10:03 15s main
February 24, 2024 10:03 15s
Adding custom metric system + IFEval as an example
Quality #174: Pull request #48 synchronize by clefourrier
February 23, 2024 19:23 16s clem_customizable_metrics
February 23, 2024 19:23 16s
Adding custom metric system + IFEval as an example
Quality #173: Pull request #48 synchronize by clefourrier
February 23, 2024 15:53 24s clem_customizable_metrics
February 23, 2024 15:53 24s
Adding custom metric system + IFEval as an example
Quality #172: Pull request #48 synchronize by clefourrier
February 23, 2024 09:54 22s clem_customizable_metrics
February 23, 2024 09:54 22s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #170: Pull request #44 synchronize by alielfilali01
February 22, 2024 17:40 6m 53s main
February 22, 2024 17:40 6m 53s
New mechanism for evaluation contributions
Quality #169: Pull request #47 synchronize by clefourrier
February 22, 2024 16:20 18s clem_custom_tasks_examples
February 22, 2024 16:20 18s
Add GPQA (#42)
Quality #168: Commit 831ad47 pushed by clefourrier
February 22, 2024 16:09 23s main
February 22, 2024 16:09 23s
Add GPQA
Quality #167: Pull request #42 synchronize by clefourrier
February 22, 2024 15:19 19s clem_add_gpqa
February 22, 2024 15:19 19s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #166: Pull request #44 synchronize by alielfilali01
February 22, 2024 14:44 22s main
February 22, 2024 14:44 22s
Improve the current chat template system (#38)
Quality #165: Commit 81fc8fd pushed by clefourrier
February 22, 2024 14:38 18s main
February 22, 2024 14:38 18s
New mechanism for evaluation contributions
Quality #164: Pull request #47 synchronize by clefourrier
February 22, 2024 14:34 15s clem_custom_tasks_examples
February 22, 2024 14:34 15s