Skip to content

Actions: huggingface/lighteval

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
352 workflow run results
352 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #176: Pull request #44 synchronize by NathanHB
February 24, 2024 10:04 18s main
February 24, 2024 10:04 18s
New mechanism for evaluation contributions (#47)
Quality #175: Commit 92e9b50 pushed by NathanHB
February 24, 2024 10:03 15s main
February 24, 2024 10:03 15s
New mechanism for evaluation contributions (#47)
Tests #175: Commit 92e9b50 pushed by NathanHB
February 24, 2024 10:03 13m 39s main
February 24, 2024 10:03 13m 39s
Adding custom metric system + IFEval as an example
Tests #174: Pull request #48 synchronize by clefourrier
February 23, 2024 19:23 13m 59s clem_customizable_metrics
February 23, 2024 19:23 13m 59s
Adding custom metric system + IFEval as an example
Quality #174: Pull request #48 synchronize by clefourrier
February 23, 2024 19:23 16s clem_customizable_metrics
February 23, 2024 19:23 16s
Adding custom metric system + IFEval as an example
Tests #173: Pull request #48 synchronize by clefourrier
February 23, 2024 15:53 16m 54s clem_customizable_metrics
February 23, 2024 15:53 16m 54s
Adding custom metric system + IFEval as an example
Quality #173: Pull request #48 synchronize by clefourrier
February 23, 2024 15:53 24s clem_customizable_metrics
February 23, 2024 15:53 24s
Adding custom metric system + IFEval as an example
Tests #172: Pull request #48 synchronize by clefourrier
February 23, 2024 09:54 13m 6s clem_customizable_metrics
February 23, 2024 09:54 13m 6s
Adding custom metric system + IFEval as an example
Quality #172: Pull request #48 synchronize by clefourrier
February 23, 2024 09:54 22s clem_customizable_metrics
February 23, 2024 09:54 22s
Adding custom metric system + IFEval as an example
Tests #171: Pull request #48 opened by clefourrier
February 23, 2024 09:53 16m 7s clem_customizable_metrics
February 23, 2024 09:53 16m 7s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Tests #170: Pull request #44 synchronize by alielfilali01
February 22, 2024 17:40 12m 57s main
February 22, 2024 17:40 12m 57s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #170: Pull request #44 synchronize by alielfilali01
February 22, 2024 17:40 6m 53s main
February 22, 2024 17:40 6m 53s
New mechanism for evaluation contributions
Tests #169: Pull request #47 synchronize by clefourrier
February 22, 2024 16:20 12m 17s clem_custom_tasks_examples
February 22, 2024 16:20 12m 17s
New mechanism for evaluation contributions
Quality #169: Pull request #47 synchronize by clefourrier
February 22, 2024 16:20 18s clem_custom_tasks_examples
February 22, 2024 16:20 18s
Add GPQA (#42)
Tests #168: Commit 831ad47 pushed by clefourrier
February 22, 2024 16:09 12m 52s main
February 22, 2024 16:09 12m 52s
Add GPQA (#42)
Quality #168: Commit 831ad47 pushed by clefourrier
February 22, 2024 16:09 23s main
February 22, 2024 16:09 23s
Add GPQA
Tests #167: Pull request #42 synchronize by clefourrier
February 22, 2024 15:19 14m 11s clem_add_gpqa
February 22, 2024 15:19 14m 11s
Add GPQA
Quality #167: Pull request #42 synchronize by clefourrier
February 22, 2024 15:19 19s clem_add_gpqa
February 22, 2024 15:19 19s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Quality #166: Pull request #44 synchronize by alielfilali01
February 22, 2024 14:44 22s main
February 22, 2024 14:44 22s
Adding support for Arabic benchmarks : AceGPT benchmarking suite
Tests #166: Pull request #44 synchronize by alielfilali01
February 22, 2024 14:44 1h 16m 12s main
February 22, 2024 14:44 1h 16m 12s
Improve the current chat template system (#38)
Tests #165: Commit 81fc8fd pushed by clefourrier
February 22, 2024 14:38 13m 27s main
February 22, 2024 14:38 13m 27s
Improve the current chat template system (#38)
Quality #165: Commit 81fc8fd pushed by clefourrier
February 22, 2024 14:38 18s main
February 22, 2024 14:38 18s
New mechanism for evaluation contributions
Quality #164: Pull request #47 synchronize by clefourrier
February 22, 2024 14:34 15s clem_custom_tasks_examples
February 22, 2024 14:34 15s
New mechanism for evaluation contributions
Tests #164: Pull request #47 synchronize by clefourrier
February 22, 2024 14:34 1h 14m 55s clem_custom_tasks_examples
February 22, 2024 14:34 1h 14m 55s