Skip to content

v1.1.5

Compare
Choose a tag to compare
@dirkbrnd dirkbrnd released this 24 Feb 15:54
· 5 commits to main since this release
b25448f

Changelog

New Features:

  • Audio Responses: Agents can now deliver audio responses (both with streaming and non-streaming).
    • The audio is in the agent.run_response.response_audio.

    • This only works with OpenAIChat with the gpt-4o-audio-preview model. See their docs for more on how it works. For example

      from agno.agent import Agent
      from agno.models.openai import OpenAIChat
      from agno.utils.audio import write_audio_to_file
      
      agent = Agent(
          model=OpenAIChat(
              id="gpt-4o-audio-preview",
              modalities=["text", "audio"],  # Both text and audio responses are provided.
              audio={"voice": "alloy", "format": "wav"},
          ),
      )
      agent.print_response(
          "Tell me a 5 second story"
      )
      if agent.run_response.response_audio is not None:
          write_audio_to_file(
              audio=agent.run_response.response_audio.base64_audio, filename=str(filename)
          )
    • See the audio_conversation_agent cookbook to test it out on the Agent Playground.

  • Image understanding support for Together.ai and XAi: You can now give images to agents using models from XAi and Together.ai.
  • Webex Tool: Added a tool for sending messages on Webex.
  • Upstash Vector DB: Added support for Upstash.

Improvements:

  • Automated Tests: Added integration tests for all models. Most of these will be run on each pull request, with a suite of integration tests run before a new release is published.
  • Grounding and Search with Gemini: Grounding and Search can be used to improve the accuracy and recency of responses from the Gemini models.

Bug Fixes:

  • Structured output updates: Fixed various cases where native structured output was not used on models
  • Ollama tool parsing: Fixed cases for Ollama with tools with optional parameters
  • Gemini Memory Summariser: Fixed cases where Gemini models were used as the memory summariser.
  • Gemini Auto Tool Calling: Enabled automatic tool calling when tools are provided, aligning behavior with other models.
  • FixedSizeChunking issue with overlap: Fixed issue where chunking would fail if overlap was set.
  • Claude Tools with Multiple Types: Fixed an issue where Claude tools would break when handling a union of types in parameters.
  • JSON Response Parsing: Fixed cases where JSON model responses returned quoted strings within dictionary values.

What's Changed

New Contributors

Full Changelog: v1.1.4...v1.1.5