[BFCL Chore] Ensure Correct Input Format for Eval Checker #860

HuanzhiMao · 2024-12-30T08:34:52Z

In some cases, the model handler’s decode_ast method returns successfully but produces output in an unexpected format, causing issues in downstream evaluations that do not perform argument format validation. This problem is especially common when the model does not output any function calls, resulting in a human-readable string instead of the expected structure.

This PR refines the is_function_calling_format_output function to enforce that outputs must be a list of dictionaries in the following format before calling the checker function:

[
  {func1: {param1: val1, param2: val2, ...}},
  {func2: {param1: val1, param2: val2, ...}},
  ...
]

Note: This PR will not affect the leaderboard score.

berkeley-function-call-leaderboard/bfcl/utils.py

better check for is_function_calling_format_output

3f2d246

HuanzhiMao added the BFCL-General General BFCL Issue label Dec 30, 2024

Merge branch 'main' into main

e5e2744

CharlieJCJ approved these changes Jan 4, 2025

View reviewed changes

berkeley-function-call-leaderboard/bfcl/utils.py Show resolved Hide resolved

add comments

194f60e

HuanzhiMao merged commit 79b1c60 into ShishirPatil:main Jan 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BFCL Chore] Ensure Correct Input Format for Eval Checker #860

[BFCL Chore] Ensure Correct Input Format for Eval Checker #860

HuanzhiMao commented Dec 30, 2024

[BFCL Chore] Ensure Correct Input Format for Eval Checker #860

[BFCL Chore] Ensure Correct Input Format for Eval Checker #860

Conversation

HuanzhiMao commented Dec 30, 2024