Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat:add tts-streaming config and future #5492

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

ic-xu
Copy link
Contributor

@ic-xu ic-xu commented Jun 21, 2024

Description

Add TTS streaming configuration and feature support, add text to speech in the feature configuration options, click on configure to start

Fixes #5251

Type of Change

  • New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Simply configure the streaming TTS option and test it

@dosubot dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. 💪 enhancement New feature or request labels Jun 21, 2024
@crazywoola crazywoola requested a review from takatost June 22, 2024 03:54
@crazywoola
Copy link
Member

@charli117 Please review this pr as well.

@charli117
Copy link
Contributor

@crazywoola ic-xu is a colleague of our team, and the version submitted now is the version we are currently using. There is also an optimization point that requires adding a blocking mode output logic for models that are not compatible and do not support streaming tts. We have communicated with each other and merged later

api/controllers/web/audio.py Outdated Show resolved Hide resolved
api/core/app/apps/base_app_queue_manager.py Outdated Show resolved Hide resolved
…e of streaming to implement TTS functionality
It has been extracted as a separate module, and processing logic has been added to the corresponding module in GenerateTask Pipeline. process(). At the same time, the expiration time has been set to be small. At the same time, Redis will not store all message objects, but only a portion of the required data, so the data will not be very large
@crazywoola crazywoola requested a review from takatost July 3, 2024 11:31
web/types/app.ts Outdated Show resolved Hide resolved
@dosubot dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Jul 4, 2024
@@ -79,6 +79,7 @@ export type TextToSpeechConfig = {
enabled: boolean
voice?: string
language?: string
autoPlay?: string
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
autoPlay?: string
autoPlay?: ttsAutoPlay

@@ -14,6 +14,7 @@ export type SuggestedQuestionsAfterAnswer = EnabledOrDisabled
export type TextToSpeech = EnabledOrDisabled & {
language?: string
voice?: string
autoPlay?: string
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

enum ttsAutoPlay

Copy link
Member

@crazywoola crazywoola left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
💪 enhancement New feature or request size:XXL This PR changes 1000+ lines, ignoring generated files.
Projects
None yet
4 participants