[FT] Enable the evaluation of any function #430

JoelNiklaus · 2024-12-10T13:39:51Z

Issue encountered

I need to evaluate google translate and other methods that do not fit in any of the evaluation modes supported currently.

Solution/Feature

I am not sure what the best way to do this is, but I see the following general solution:
We add one ModelConfig, where the user can specify a function that is called for a batch of examples. As long as the user provides the output correctly formatted, they can evaluate any system they like. It does not need to fit into the model_configs supported so far.
Happy to propose a PR for this.

JoelNiklaus added the feature request New feature/request label Dec 10, 2024

JoelNiklaus linked a pull request Dec 11, 2024 that will close this issue

Added first version of custom model. #437

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FT] Enable the evaluation of any function #430

[FT] Enable the evaluation of any function #430

JoelNiklaus commented Dec 10, 2024

[FT] Enable the evaluation of any function #430

[FT] Enable the evaluation of any function #430

Comments

JoelNiklaus commented Dec 10, 2024

Issue encountered

Solution/Feature