Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Voice Capabilities to Nevron (ElevenLabs Integration) #135

Open
gromdimon opened this issue Feb 11, 2025 · 0 comments
Open

Add Voice Capabilities to Nevron (ElevenLabs Integration) #135

gromdimon opened this issue Feb 11, 2025 · 0 comments
Assignees
Labels
feature New feature or request
Milestone

Comments

@gromdimon
Copy link
Contributor

gromdimon commented Feb 11, 2025

Description

We need to add voice capabilities to Nevron by integrating ElevenLabs for text-to-speech (TTS) generation. This will allow the agent to respond with audio, making interactions more engaging and accessible.

The feature should:

  1. Convert agent responses into speech using ElevenLabs’ TTS API.

Requirements

1. ElevenLabs API Integration

  • Use ElevenLabs TTS API for generating voice responses.
  • Support different voices and accents provided by ElevenLabs.
  • Ensure API keys and settings are configurable via .env.
  • Add this to new Executor

2. Voice Configuration

  • Allow users to choose different voices (e.g., Rachel, Adam, etc.).
  • Support custom speed, pitch, and tone settings.
  • Ensure API limits are handled properly.

3. Testing

  • Validate the API response times and audio quality.
  • Ensure audio files are stored securely and cleared after use.
@gromdimon gromdimon added the feature New feature or request label Feb 11, 2025
@gromdimon gromdimon added this to the v0.2.2 milestone Feb 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants