Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Configurable silence between audio chunks #55

Open
wants to merge 1 commit into
base: dev
Choose a base branch
from

Conversation

bi1101
Copy link

@bi1101 bi1101 commented Aug 28, 2024

No description provided.

@matatonic
Copy link
Owner

How does this work for you? Satisfied?

Given that the most common user right now is open-webui which also splits into sentences before sending to openai-speech, I'll probably add the silence pad to all output, not skipping the last.

@matatonic
Copy link
Owner

Can you add a command line option also, which takes the environment variable as the default? If you don't get to it, I will. Also - can you make the PR against the current dev branch instead? It's soon to be released.

@bi1101
Copy link
Author

bi1101 commented Aug 28, 2024

It works well.

I think addding padding to all sentences could be a better all-rounded approach for most usecases

@bi1101 bi1101 changed the base branch from main to dev August 28, 2024 19:53
@thiswillbeyourgithub
Copy link

How does this work for you? Satisfied?

Given that the most common user right now is open-webui which also splits into sentences before sending to openai-speech, I'll probably add the silence pad to all output, not skipping the last.

Hi, FYI since last release (0.3.16 i think) of openwebui we can set the tts split to be based on punctuation, paragraph or don't split

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants