Add support for stream pipeline decorator #543

rayrayraykk · 2025-02-28T06:37:26Z

name: add support for stream pipeline decorator
about: make the agentscope app a generator with simple decorator

Description

This pull request aims to enhance the agentscope application by introducing support for a stream pipeline decorator, which facilitates better compatibility with other API server pipelines. The main goal is to enable the agentscope app to return data in a streaming manner, allowing it to integrate more effectively with various API servers that require or benefit from streaming data capabilities.

Key Features

Pipeline Decorator: Implements a @pipeline decorator that transforms the application into a generator, enabling streaming data output.

Example Usage:

# -*- coding: utf-8 -*-
"""A simple example for conversation between user and assistant agent."""
import os
import agentscope
from agentscope.agents import DialogAgent
from agentscope.agents.user_agent import UserAgent
from agentscope.pipelines.functional import sequentialpipeline
from agentscope.utils.common import pipeline


@pipeline
def main() -> None:
    """A basic conversation demo"""

    agentscope.init(
        model_configs=[
            {
                "model_type": "dashscope_chat",
                "config_name": "qwen-max",
                "model_name": "qwen-max",
                "api_key": os.getenv("DASHSCOPE_API_KEY"),
                "stream": True,
            },
        ],
        project="Multi-Agent Conversation",
        save_api_invoke=True,
    )

    # Init two agents
    dialog_agent = DialogAgent(
        name="Assistant",
        sys_prompt="You're a helpful assistant.",
        model_config_name="qwen-max",  # replace by your model config name
    )
    user_agent = UserAgent()

    # start the conversation between user and assistant
    x = None
    while x is None or x.content != "exit":
        x = sequentialpipeline([user_agent, dialog_agent], x)


if __name__ == "__main__":
    for index, msg in enumerate(main()):
        print(index, msg.name, msg.content)

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

rayrayraykk · 2025-02-28T06:39:58Z

@pan-x-c Please help me to check the compatibility in distributed mode, thanks.

DavdGao · 2025-03-03T08:39:38Z

src/agentscope/utils/common.py

+    def wrapper(*args: Any, **kwargs: Any) -> Generator:
+        from ..logging import get_msg_instances, clear_msg_instances
+
+        thread_id = "pipeline" + str(uuid.uuid4())


Why do we need to use a thread here?

When the generator serves as an API, we need to continue running the main program as a child thread and push messages to a request-isolated place (i.e., _MSG_INSTANCE[thread_id]). Then, we use the main thread to retrieve the information and return it as a yield.

DavdGao

Please see inline comments.

Besides, if we only want to obtain the application-level streaming output, how about use a hook function within the agent? So that

we don't need to use threading (which maybe dangerous in dsitributed mode).
we can extend it to tts easily.
it's explicit for the developers and users
E.g.

agent = Agent(
    # xxx
)

agent.register_speak_pre_hook()
agent.register_speak_post_hook()
agent.register_reply_pre_hook()
agent.register_reply_post_hook()
# ...

# for application level output
def send_to_somewhere(agent, input):
    """
    Args: 
        agent (`AgentBase`):
            The agent module itself
        input (`message | Generator`):
            # ...
    """
    # ...

agent.register_speak_pre_hook(
    send_to_somewhere
)

So that we can re-use this module when we implement tts functionality (e.g. speak out the message when calling the speak function)

rayrayraykk · 2025-03-03T10:55:15Z

Please see inline comments.

Besides, if we only want to obtain the application-level streaming output, how about use a hook function within the agent? So that

we don't need to use threading (which maybe dangerous in dsitributed mode).

we can extend it to tts easily.

it's explicit for the developers and users
E.g.
agent = Agent(
    # xxx
)

agent.register_speak_pre_hook()
agent.register_speak_post_hook()
agent.register_reply_pre_hook()
agent.register_reply_post_hook()
# ...

# for application level output
def send_to_somewhere(agent, input):
    """
    Args: 
        agent (`AgentBase`):
            The agent module itself
        input (`message | Generator`):
            # ...
    """
    # ...

agent.register_speak_pre_hook(
    send_to_somewhere
)
So that we can re-use this module when we implement tts functionality (e.g. speak out the message when calling the speak function)

Using the hook function to send messages is a good approach. However, to convert the application into a generator, it may still require concurrent threads or processes (one send, one get). Additionally, the model's response generator can only be yielded once, which already occurs in the speak function. To implement this change, we would need to modify the existing structure to accommodate this adjustment.

DavdGao · 2025-03-07T08:57:26Z

Please see inline comments.
Besides, if we only want to obtain the application-level streaming output, how about use a hook function within the agent? So that

we don't need to use threading (which maybe dangerous in dsitributed mode).

we can extend it to tts easily.

it's explicit for the developers and users
E.g.
agent = Agent(
    # xxx
)

agent.register_speak_pre_hook()
agent.register_speak_post_hook()
agent.register_reply_pre_hook()
agent.register_reply_post_hook()
# ...

# for application level output
def send_to_somewhere(agent, input):
    """
    Args: 
        agent (`AgentBase`):
            The agent module itself
        input (`message | Generator`):
            # ...
    """
    # ...

agent.register_speak_pre_hook(
    send_to_somewhere
)
So that we can re-use this module when we implement tts functionality (e.g. speak out the message when calling the speak function)
Please see inline comments.
Besides, if we only want to obtain the application-level streaming output, how about use a hook function within the agent? So that

we don't need to use threading (which maybe dangerous in dsitributed mode).

we can extend it to tts easily.

it's explicit for the developers and users
E.g.
agent = Agent(
    # xxx
)

agent.register_speak_pre_hook()
agent.register_speak_post_hook()
agent.register_reply_pre_hook()
agent.register_reply_post_hook()
# ...

# for application level output
def send_to_somewhere(agent, input):
    """
    Args: 
        agent (`AgentBase`):
            The agent module itself
        input (`message | Generator`):
            # ...
    """
    # ...

agent.register_speak_pre_hook(
    send_to_somewhere
)
So that we can re-use this module when we implement tts functionality (e.g. speak out the message when calling the speak function)
Using the hook function to send messages is a good approach. However, to convert the application into a generator, it may still require concurrent threads or processes (one send, one get). Additionally, the model's response generator can only be yielded once, which already occurs in the speak function. To implement this change, we would need to modify the existing structure to accommodate this adjustment.

In this case, we just need to create a "during"(maybe some other name) hook.

def speak(self, msg: Union[Msg, Generator]):
    if isinstance(msg, Generator):
        for chunk in msg:
            // call hook function
            for func in self.__hooks_during_speak:
                func(self, chunk)
            // normal processing
            log_stream_msg(xxx)
            # ...

rayrayraykk · 2025-03-10T03:48:30Z

After discussion, we should implement pushing msg to somewhere in a hook-like function whin the AgentBase. I' will do it after hongyi's PR.

add support for stream pipeline decorator

e36fbd0

rayrayraykk requested review from pan-x-c, DavdGao, qbc2016, ZiTao-Li and FredericW February 28, 2025 06:40

rayrayraykk added 2 commits February 28, 2025 16:00

update

09c3ba9

add func to clear

25d06dc

DavdGao reviewed Mar 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for stream pipeline decorator #543

Add support for stream pipeline decorator #543

rayrayraykk commented Feb 28, 2025 •

edited

Loading

rayrayraykk commented Feb 28, 2025

DavdGao Mar 3, 2025 •

edited

Loading

rayrayraykk Mar 3, 2025 •

edited

Loading

DavdGao left a comment •

edited

Loading

rayrayraykk commented Mar 3, 2025

DavdGao commented Mar 7, 2025

rayrayraykk commented Mar 10, 2025

Add support for stream pipeline decorator #543

Are you sure you want to change the base?

Add support for stream pipeline decorator #543

Conversation

rayrayraykk commented Feb 28, 2025 • edited Loading

name: add support for stream pipeline decorator about: make the agentscope app a generator with simple decorator

Description

Key Features

Example Usage:

Checklist

rayrayraykk commented Feb 28, 2025

DavdGao Mar 3, 2025 • edited Loading

Choose a reason for hiding this comment

rayrayraykk Mar 3, 2025 • edited Loading

Choose a reason for hiding this comment

DavdGao left a comment • edited Loading

Choose a reason for hiding this comment

rayrayraykk commented Mar 3, 2025

DavdGao commented Mar 7, 2025

rayrayraykk commented Mar 10, 2025

rayrayraykk commented Feb 28, 2025 •

edited

Loading

name: add support for stream pipeline decorator
about: make the agentscope app a generator with simple decorator

DavdGao Mar 3, 2025 •

edited

Loading

rayrayraykk Mar 3, 2025 •

edited

Loading

DavdGao left a comment •

edited

Loading