Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request] Text to Speach #315

Closed
HyunjunA opened this issue Sep 18, 2023 · 4 comments · Fixed by #345
Closed

[Feature request] Text to Speach #315

HyunjunA opened this issue Sep 18, 2023 · 4 comments · Fixed by #345
Labels
enhancement New feature or request

Comments

@HyunjunA
Copy link

Name of the feature
In general, the feature you want added should be supported by HuggingFace's transformers library:

  • If requesting a model, it must be listed here.
  • If requesting a pipeline, it must be listed here.
  • If requesting a task, it must be listed here.

Model: SpeechT5_TTS (Text-to-Speech)
The model is available here on HuggingFace's platform.

Reason for request
Why is it important that we add this feature? What is your intended use case? Remember, we are more likely to add support for models/pipelines/tasks that are popular (e.g., many downloads), or contain functionality that does not exist (e.g., new input type).

Usefulness: The SpeechT5_TTS model is designed to convert text into spoken audio. This model could serve multiple purposes across different sectors, including but not limited to education, automation, and accessibility.

Popularity: As a Microsoft model, it is backed by significant research and development, making it one of the more robust and versatile options available for text-to-speech.

New Functionality: Though text-to-speech is not a new technology, the advanced capabilities of this model could offer more natural and clear speech, which is especially valuable in applications where voice clarity and natural intonation are important.

Additional context
Add any other context or screenshots about the feature request here.

@HyunjunA HyunjunA added the enhancement New feature or request label Sep 18, 2023
@xenova
Copy link
Collaborator

xenova commented Sep 18, 2023

Hi there 👋 See #59 and #279 for existing feature requests. We are currently waiting for Optimum to support exporting speecht5 (and bark) to ONNX. Perhaps @fxmarty can provide an update?

@fxmarty
Copy link

fxmarty commented Sep 19, 2023

Thank you, given the interest I could add the support this week & do a release. Which architecture are you interested in priority?

@HyunjunA
Copy link
Author

I want to use this model: microsoft/speecht5_tts.

@xenova xenova linked a pull request Oct 23, 2023 that will close this issue
3 tasks
@aiascq
Copy link

aiascq commented Dec 23, 2023

I want to use this model:bark

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants