Skip to content

Actions: huggingface/lighteval

Quality

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
321 workflow run results
321 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Supports extended tasks (#101)
Quality #334: Commit df21407 pushed by clefourrier
March 9, 2024 18:25 2m 26s main
March 9, 2024 18:25 2m 26s
Supports extended tasks
Quality #333: Pull request #101 synchronize by NathanHB
March 9, 2024 15:58 2m 20s clem_support_extended_tasks
March 9, 2024 15:58 2m 20s
Fixes input length management for generative evals (#103)
Quality #332: Commit e8e4df7 pushed by NathanHB
March 9, 2024 15:57 2m 46s main
March 9, 2024 15:57 2m 46s
Fixes input length management for generative evals
Quality #331: Pull request #103 synchronize by NathanHB
March 9, 2024 15:10 2m 30s clem_fix_max_len_gen
March 9, 2024 15:10 2m 30s
Fixes input length management for generative evals
Quality #330: Pull request #103 synchronize by NathanHB
March 9, 2024 14:29 2m 17s clem_fix_max_len_gen
March 9, 2024 14:29 2m 17s
Supports extended tasks
Quality #329: Pull request #101 synchronize by NathanHB
March 9, 2024 14:26 2m 21s clem_support_extended_tasks
March 9, 2024 14:26 2m 21s
Supports extended tasks
Quality #328: Pull request #101 synchronize by NathanHB
March 9, 2024 10:47 2m 21s clem_support_extended_tasks
March 9, 2024 10:47 2m 21s
Add mt-bench
Quality #327: Pull request #75 synchronize by NathanHB
March 9, 2024 10:27 2m 27s nathan-add-mt-bench
March 9, 2024 10:27 2m 27s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #326: Pull request #95 synchronize by alielfilali01
March 8, 2024 18:08 2m 26s GMAL-Org:main
March 8, 2024 18:08 2m 26s
Add BBH (#7)
Quality #325: Commit e324a83 pushed by clefourrier
March 8, 2024 17:58 2m 21s main
March 8, 2024 17:58 2m 21s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #324: Pull request #95 synchronize by alielfilali01
March 8, 2024 17:44 4m 55s GMAL-Org:main
March 8, 2024 17:44 4m 55s
Add BBH
Quality #323: Pull request #7 synchronize by clefourrier
March 8, 2024 17:43 6m 49s add_bbh
March 8, 2024 17:43 6m 49s
Adding TinyBench
Quality #322: Pull request #104 opened by clefourrier
March 8, 2024 17:33 2m 21s add_tinybenchs
March 8, 2024 17:33 2m 21s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #321: Pull request #95 synchronize by alielfilali01
March 8, 2024 16:37 3m 21s GMAL-Org:main
March 8, 2024 16:37 3m 21s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #320: Pull request #95 synchronize by alielfilali01
March 8, 2024 16:14 7m 14s GMAL-Org:main
March 8, 2024 16:14 7m 14s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #319: Pull request #95 synchronize by alielfilali01
March 8, 2024 16:13 2m 24s GMAL-Org:main
March 8, 2024 16:13 2m 24s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #318: Pull request #95 synchronize by alielfilali01
March 8, 2024 16:06 2m 22s GMAL-Org:main
March 8, 2024 16:06 2m 22s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #317: Pull request #95 synchronize by alielfilali01
March 8, 2024 15:57 2m 16s GMAL-Org:main
March 8, 2024 15:57 2m 16s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #316: Pull request #95 synchronize by alielfilali01
March 8, 2024 15:20 2m 21s GMAL-Org:main
March 8, 2024 15:20 2m 21s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #315: Pull request #95 synchronize by alielfilali01
March 8, 2024 15:16 2m 28s GMAL-Org:main
March 8, 2024 15:16 2m 28s
Rolling management (#78)
Quality #314: Commit bca2b1d pushed by clefourrier
March 8, 2024 13:19 2m 27s main
March 8, 2024 13:19 2m 27s
Fixes input length management for generative evals
Quality #313: Pull request #103 opened by clefourrier
March 8, 2024 13:17 2m 21s clem_fix_max_len_gen
March 8, 2024 13:17 2m 21s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #312: Pull request #95 synchronize by alielfilali01
March 7, 2024 22:33 2m 16s GMAL-Org:main
March 7, 2024 22:33 2m 16s
Adding support for Arabic benchmarks : AlGhafa benchmarking suite
Quality #311: Pull request #95 synchronize by alielfilali01
March 7, 2024 22:31 5m 5s GMAL-Org:main
March 7, 2024 22:31 5m 5s
Supports extended tasks
Quality #309: Pull request #101 synchronize by clefourrier
March 7, 2024 11:17 2m 16s clem_support_extended_tasks
March 7, 2024 11:17 2m 16s