Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

speed of generating speech samples #19

Open
dengyan opened this issue Jul 27, 2017 · 1 comment
Open

speed of generating speech samples #19

dengyan opened this issue Jul 27, 2017 · 1 comment

Comments

@dengyan
Copy link

dengyan commented Jul 27, 2017

I found that SampleRNN need to be run in parallel to get fast generation speed. It takes only about 500 seconds for generating 200 utterances, each with a length of 8 seconds speech. But it will be very time costing if only run one sentence in generation, more than 40 seconds for 1 second speech. It seems it's not faster than Wavenet. Does anyone have some ideas on speeding up it?

@dengyan dengyan changed the title speed of genrating speech samples speed of generating speech samples Jul 27, 2017
@Cortexelus
Copy link

Cortexelus commented Dec 7, 2017

Using a p3x16large AWS instance
NVIDIA Tesla V100
CUDA 9

This appears to run 10x the speed of dengyan's setup.

It takes us 1000 seconds to generate 4 minute audio files.

If we generate 100 of these in parallel
that's 24 seconds of generative audio for every 1 second of processing

If we generate 1 of these:
That's 0.24 seconds of generative audio for every 1 second of processing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants