Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release 1.1.5 #2202

Merged
merged 13 commits into from
Feb 24, 2025
Merged

Release 1.1.5 #2202

merged 13 commits into from
Feb 24, 2025

Conversation

dirkbrnd
Copy link
Contributor

@dirkbrnd dirkbrnd commented Feb 21, 2025

Changelog

New Features:

  • Audio Responses: Agents can now deliver audio responses (both with streaming and non-streaming).
    • The audio is in the agent.run_response.response_audio.

    • This only works with OpenAIChat with the gpt-4o-audio-preview model. See their docs for more on how it works. For example

      from agno.agent import Agent
      from agno.models.openai import OpenAIChat
      from agno.utils.audio import write_audio_to_file
      
      agent = Agent(
          model=OpenAIChat(
              id="gpt-4o-audio-preview",
              modalities=["text", "audio"],  # Both text and audio responses are provided.
              audio={"voice": "alloy", "format": "wav"},
          ),
      )
      agent.print_response(
          "Tell me a 5 second story"
      )
      if agent.run_response.response_audio is not None:
          write_audio_to_file(
              audio=agent.run_response.response_audio.base64_audio, filename=str(filename)
          )
    • See the audio_conversation_agent cookbook to test it out on the Agent Playground.

  • Image understanding support for Together.ai and XAi: You can now give images to agents using models from XAi and Together.ai.
  • Webex Tool: Added a tool for sending messages on Webex.
  • Upstash Vector DB: Added support for Upstash.

Improvements:

  • Automated Tests: Added integration tests for all models. Most of these will be run on each pull request, with a suite of integration tests run before a new release is published.
  • Grounding and Search with Gemini: Grounding and Search can be used to improve the accuracy and recency of responses from the Gemini models.

Bug Fixes:

  • Structured output updates: Fixed various cases where native structured output was not used on models
  • Ollama tool parsing: Fixed cases for Ollama with tools with optional parameters
  • Gemini Memory Summariser: Fixed cases where Gemini models were used as the memory summariser.
  • Gemini Auto Tool Calling: Enabled automatic tool calling when tools are provided, aligning behavior with other models.
  • FixedSizeChunking issue with overlap: Fixed issue where chunking would fail if overlap was set.
  • Claude Tools with Multiple Types: Fixed an issue where Claude tools would break when handling a union of types in parameters.
  • JSON Response Parsing: Fixed cases where JSON model responses returned quoted strings within dictionary values.

@dirkbrnd dirkbrnd requested a review from a team as a code owner February 21, 2025 19:25
@dirkbrnd dirkbrnd merged commit b25448f into main Feb 24, 2025
19 checks passed
@dirkbrnd dirkbrnd deleted the release-1.1.5 branch February 24, 2025 15:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants