Model eval #1047

iankur · 2024-06-04T06:54:45Z

Does the elether_eval recipe call model.eval() somewhere? I tried to find it but could not. It's there in the lm evaluation harness but we have overriden the constructor in eval wrapper. Also, my results change when I switch model to eval mode.

joecummings · 2024-06-04T15:44:41Z

Great catch @iankur! I've created a PR to address this change - feel free to follow along there.

iankur · 2024-07-02T04:55:00Z

Thanks @joecummings. Do you mind sharing why does eleuther_eval recipe use evaluate instead of simple_evaluate method from lm-eval-harness? The reason I am asking is evaluate does not allow few shot evaluation whereas simple_evaluate accepts num_fewshot argument for the same.

joecummings self-assigned this Jun 4, 2024

joecummings linked a pull request Jun 4, 2024 that will close this issue

Put model in eval mode for evaluation #1049

Draft

felipemello1 added bug Something isn't working good first issue Good for newcomers labels Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model eval #1047

Model eval #1047

iankur commented Jun 4, 2024

joecummings commented Jun 4, 2024

iankur commented Jul 2, 2024

Model eval #1047

Model eval #1047

Comments

iankur commented Jun 4, 2024

joecummings commented Jun 4, 2024

iankur commented Jul 2, 2024