Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,770 workflow runs
1,770 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Extractive Match metric
Tests #2014: Pull request #495 synchronize by hynky1999
January 13, 2025 14:09 In progress math_extraction
January 13, 2025 14:09 In progress
Extractive Match metric
Tests #2013: Pull request #495 synchronize by hynky1999
January 13, 2025 14:08 In progress math_extraction
January 13, 2025 14:08 In progress
Extractive Match metric
Tests #2012: Pull request #495 synchronize by hynky1999
January 13, 2025 13:14 38m 7s math_extraction
January 13, 2025 13:14 38m 7s
llm_as_a_judge_for_oallv2_arabic
Tests #2011: Pull request #498 opened by Manel-Hik
January 13, 2025 11:30 Action required Manel-Hik:main
January 13, 2025 11:30 Action required
Add swiss legal evals as new community tasks
Tests #2010: Pull request #389 synchronize by JoelNiklaus
January 13, 2025 05:35 Action required JoelNiklaus:add_swiss_legal_evals
January 13, 2025 05:35 Action required
Initial proposal for model lazy loading
Tests #2009: Pull request #497 opened by JoelNiklaus
January 11, 2025 21:15 Action required JoelNiklaus:lazy-load-model-init
January 11, 2025 21:15 Action required
Extractive Match metric
Tests #2008: Pull request #495 opened by hynky1999
January 11, 2025 19:03 41m 6s math_extraction
January 11, 2025 19:03 41m 6s
Added custom model inference.
Tests #2007: Pull request #437 synchronize by JoelNiklaus
January 11, 2025 18:31 Action required JoelNiklaus:add-custom-model
January 11, 2025 18:31 Action required
Add Doc Strings to Config Files
Tests #2005: Pull request #465 synchronize by ParagEkbote
January 11, 2025 14:41 Action required ParagEkbote:Document-Custom-Model-Files
January 11, 2025 14:41 Action required
Add swiss legal evals as new community tasks
Tests #2000: Pull request #389 synchronize by JoelNiklaus
January 10, 2025 18:13 Action required JoelNiklaus:add_swiss_legal_evals
January 10, 2025 18:13 Action required
Add swiss legal evals as new community tasks
Tests #1999: Pull request #389 synchronize by JoelNiklaus
January 10, 2025 16:55 Action required JoelNiklaus:add_swiss_legal_evals
January 10, 2025 16:55 Action required
Fixed issue with o1 in litellm.
Tests #1998: Pull request #493 opened by JoelNiklaus
January 10, 2025 02:10 40m 58s JoelNiklaus:fix-o1-litellm
January 10, 2025 02:10 40m 58s
Add swiss legal evals as new community tasks
Tests #1995: Pull request #389 synchronize by JoelNiklaus
January 7, 2025 18:14 Action required JoelNiklaus:add_swiss_legal_evals
January 7, 2025 18:14 Action required
Made judge response processing more robust.
Tests #1994: Pull request #491 opened by JoelNiklaus
January 7, 2025 16:48 Action required JoelNiklaus:fix-process-response
January 7, 2025 16:48 Action required
Tests
Tests #1993: by clefourrier
January 7, 2025 15:20 39m 24s main
January 7, 2025 15:20 39m 24s
Hotfix for litellm judge
Tests #1992: Pull request #490 synchronize by JoelNiklaus
January 7, 2025 15:17 37m 41s JoelNiklaus:fix-litellm-judge
January 7, 2025 15:17 37m 41s
Hotfix for litellm judge
Tests #1990: Pull request #490 synchronize by JoelNiklaus
January 7, 2025 14:57 42m 9s JoelNiklaus:fix-litellm-judge
January 7, 2025 14:57 42m 9s