feat: add indictor for tool failure to FunctionExecutionResult #5428

wistuba · 2025-02-07T13:54:12Z

Why are these changes needed?

Some LLMs recieve an explicit signal about tool use failures. This change will allow

Related issue number

Closes #5273

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

We probably want to add some additional testing. I need some advice where to add it.

wistuba · 2025-02-07T13:55:29Z

@microsoft-github-policy-service agree company="Amazon"

wistuba · 2025-02-07T13:56:16Z

@microsoft-github-policy-service agree company="Amazon"

python/packages/autogen-core/src/autogen_core/models/_types.py

ekzhu

You can run uv sync --all-extras in the 'python' directory to sync your local environment. There is no dependency change so there shouldn't be update to the uv.lock file

python/uv.lock

wistuba · 2025-02-08T20:12:49Z

You can run uv sync --all-extras in the 'python' directory to sync your local environment. There is no dependency change so there shouldn't be update to the uv.lock file

I did that. Doesn't help. I manually reverted it, but the uv sync keeps reverting it. With main uv lock, I keep seeing issues with pyright when running poe check. It requires me to manually install pyright which may cause this problem:

RuntimeError: nodeenv failed; for more reliable node.js binaries try `pip install pyright[nodejs]`
Error: Sequence aborted after failed subtask 'pyright'

ekzhu · 2025-02-09T03:57:45Z

Could you try to purge your local virtualenv completely and recreate on again. Also make sure your uv installation is up-to-date.

git clone https://github.com/microsoft/autogen
cd autogen/python
uv venv --python=3.12
source .venv/bin/activate
uv sync --all-extras

I just ran the CI tests and it works. So likely your local environment setup has some issue.

codecov · 2025-02-09T03:59:06Z

Codecov Report

Attention: Patch coverage is 70.00000% with 6 lines in your changes missing coverage. Please review.

Project coverage is 78.32%. Comparing base (b8c5e49) to head (a18e154).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...togen_ext/agents/openai/_openai_assistant_agent.py	73.33%	4 Missing ⚠️
...t/src/autogen_agentchat/agents/_assistant_agent.py	50.00%	1 Missing ⚠️
...n-core/src/autogen_core/tool_agent/_caller_loop.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5428      +/-   ##
==========================================
- Coverage   78.32%   78.32%   -0.01%     
==========================================
  Files         165      165              
  Lines        9800     9803       +3     
==========================================
+ Hits         7676     7678       +2     
- Misses       2124     2125       +1

Flag	Coverage Δ
unittests	`78.32% <70.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

python/packages/autogen-ext/src/autogen_ext/agents/openai/_openai_assistant_agent.py

ekzhu · 2025-02-10T06:00:55Z

Thanks! For now we don't really have a sample showing this. I believe with Anthropic model client, we can utlize the error field to indicate error, as described there: https://docs.anthropic.com/en/docs/build-with-claude/tool-use#troubleshooting-errors.

#5205

wistuba and others added 2 commits February 7, 2025 14:45

feat: add indictor for tool failure to FunctionExecutionResult

a3abe68

Merge branch 'main' into main

b5d262d

Merge branch 'main' into main

17a8cdf

ekzhu requested changes Feb 7, 2025

View reviewed changes

python/packages/autogen-core/src/autogen_core/models/_types.py Outdated Show resolved Hide resolved

wistuba added 2 commits February 8, 2025 12:03

set default value for is_error

b81c06e

Merge branch 'main' of https://github.com/wistuba/autogen

6993ad8

ekzhu reviewed Feb 8, 2025

View reviewed changes

python/packages/autogen-core/src/autogen_core/models/_types.py Outdated Show resolved Hide resolved

make is_error optional

1884ded

wistuba requested a review from ekzhu February 8, 2025 17:30

ekzhu reviewed Feb 8, 2025

View reviewed changes

python/uv.lock Outdated Show resolved Hide resolved

wistuba and others added 2 commits February 8, 2025 21:08

revert uv lock

c94bb37

Merge branch 'main' into main

1d88e8e

wistuba requested a review from ekzhu February 8, 2025 20:12

ekzhu requested changes Feb 9, 2025

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/agents/openai/_openai_assistant_agent.py Show resolved Hide resolved

Merge branch 'main' into main

f474d1a

ekzhu approved these changes Feb 10, 2025

View reviewed changes

ekzhu added 2 commits February 9, 2025 21:38

Merge branch 'main' into main

2cb3c51

Merge branch 'main' into main

a18e154

ekzhu merged commit 7a772a2 into microsoft:main Feb 10, 2025
65 of 66 checks passed

ekzhu mentioned this pull request Feb 10, 2025

Support for anthropic models in v0.4 #5205

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add indictor for tool failure to FunctionExecutionResult #5428

feat: add indictor for tool failure to FunctionExecutionResult #5428

wistuba commented Feb 7, 2025

wistuba commented Feb 7, 2025 •

edited

Loading

wistuba commented Feb 7, 2025

ekzhu left a comment

wistuba commented Feb 8, 2025

ekzhu commented Feb 9, 2025 •

edited

Loading

codecov bot commented Feb 9, 2025 •

edited

Loading

ekzhu commented Feb 10, 2025

feat: add indictor for tool failure to FunctionExecutionResult #5428

feat: add indictor for tool failure to FunctionExecutionResult #5428

Conversation

wistuba commented Feb 7, 2025

Why are these changes needed?

Related issue number

Checks

wistuba commented Feb 7, 2025 • edited Loading

wistuba commented Feb 7, 2025

ekzhu left a comment

Choose a reason for hiding this comment

wistuba commented Feb 8, 2025

ekzhu commented Feb 9, 2025 • edited Loading

codecov bot commented Feb 9, 2025 • edited Loading

Codecov Report

ekzhu commented Feb 10, 2025

wistuba commented Feb 7, 2025 •

edited

Loading

ekzhu commented Feb 9, 2025 •

edited

Loading

codecov bot commented Feb 9, 2025 •

edited

Loading