fix checking final chunk in ReAct agent #11280

leehuwuj · 2024-02-22T10:09:01Z

Description

This PR to fix a minor issue in ReActAgent.

The issue:

ReActAgent keeps include internal states ("Though: ", "Action: ", "Observation: ",...) in chat response.

Preproduce:

I create a ReActAgent with LLama2 model:

agent = ReActAgent.from_llm(llm=llm)
res = agent.stream_chat("What is LlamaIndex?")
for token in res:
    print(token, end="")

Response output:

Thought: I need to use a tool to help me answer the question.
Action: search_engine (one of the tools)
Action Input: {"query": "LlamaIndex"}

Observation: LlamaIndex is a search engine ranking metric that measures the relevance of a website's content to a specific query. It is used to determine the quality and relevance of a website's content in relation to a user's search query.

Thought: I can answer without using any more tools.
Answer: LlamaIndex is a search engine ranking metric that measures the relevance of a website's content to a specific query.

Expected output:

Should response the "Answer: " chunk only

LlamaIndex is a search engine ranking metric that measures the relevance of a website's content to a specific query.

Type of Change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Added new unit/integration tests
Added new notebook (that tests end-to-end)
I stared at the code and made sure it makes sense

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

leehuwuj · 2024-02-22T10:13:04Z

llama-index-core/llama_index/core/agent/react/step.py

-            if not latest_content.startswith(
-                "Thought"
-            ):  # doesn't follow thought-action format
-                return True


This checking logic seems weird to me. While I'm unsure about all response cases, shouldn't the final chunk always be "Answer: "?

the model could also go off the rails (very common for open-source LLMs to arbitrarily stop following the react format exactly)

Got you! but as you see in my case, the model actually response that start with "Thought", which follows the format.

The issue arises because the received chunk doesn't always contain a complete "Thought" word. For example, in my case, the sequence is: ('Th', 'Thought', 'Thought: ', 'Thought: I',...). Consequently, the worker bypasses the reasoning step and directly outputs the full model content. Evidence supporting this reason is that if i start a chat (without stream) then it works correctly.

This change then fixed my issue:

if len(latest_content) > 7 and not latest_content.startswith("Thought"): return True

Do you have other better idea?

@leehuwuj this looks good to me - your one-liner fixes the parser for streaming. @logan-markewich so the "model going off rail check" is still kept - it just works now also with streaming. Can you merge that?

fix wrong logic in checking final chunk of reactagent

81e87f6

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Feb 22, 2024

leehuwuj commented Feb 22, 2024

View reviewed changes

update final chunk checking logic

906d17b

leehuwuj force-pushed the lee/fix-reactagent-chat-response branch from 13a0a2d to 906d17b Compare February 23, 2024 09:53

better checking condition

7a8c137

marcusschiesser approved these changes Feb 26, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Feb 26, 2024

logan-markewich approved these changes Feb 26, 2024

View reviewed changes

logan-markewich merged commit cc8e1ee into run-llama:main Feb 26, 2024
8 checks passed

Dominastorm pushed a commit to uptrain-ai/llama_index that referenced this pull request Feb 28, 2024

fix checking final chunk in ReAct agent (run-llama#11280)

452a522

anoopshrma pushed a commit to anoopshrma/llama_index that referenced this pull request Mar 2, 2024

fix checking final chunk in ReAct agent (run-llama#11280)

bb32891

Izukimat pushed a commit to Izukimat/llama_index that referenced this pull request Mar 29, 2024

fix checking final chunk in ReAct agent (run-llama#11280)

963533c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix checking final chunk in ReAct agent #11280

fix checking final chunk in ReAct agent #11280

leehuwuj commented Feb 22, 2024

leehuwuj Feb 22, 2024

logan-markewich Feb 22, 2024

leehuwuj Feb 23, 2024

marcusschiesser Feb 26, 2024

fix checking final chunk in ReAct agent #11280

fix checking final chunk in ReAct agent #11280

Conversation

leehuwuj commented Feb 22, 2024

Description

The issue:

Preproduce:

Type of Change

How Has This Been Tested?

Suggested Checklist:

leehuwuj Feb 22, 2024

Choose a reason for hiding this comment

logan-markewich Feb 22, 2024

Choose a reason for hiding this comment

leehuwuj Feb 23, 2024

Choose a reason for hiding this comment

marcusschiesser Feb 26, 2024

Choose a reason for hiding this comment