speed of generating speech samples #19

dengyan · 2017-07-27T06:17:07Z

I found that SampleRNN need to be run in parallel to get fast generation speed. It takes only about 500 seconds for generating 200 utterances, each with a length of 8 seconds speech. But it will be very time costing if only run one sentence in generation, more than 40 seconds for 1 second speech. It seems it's not faster than Wavenet. Does anyone have some ideas on speeding up it?

Cortexelus · 2017-12-07T23:11:36Z

Using a p3x16large AWS instance
NVIDIA Tesla V100
CUDA 9

This appears to run 10x the speed of dengyan's setup.

It takes us 1000 seconds to generate 4 minute audio files.

If we generate 100 of these in parallel
that's 24 seconds of generative audio for every 1 second of processing

If we generate 1 of these:
That's 0.24 seconds of generative audio for every 1 second of processing

dengyan changed the title ~~speed of genrating speech samples~~ speed of generating speech samples Jul 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speed of generating speech samples #19

speed of generating speech samples #19

dengyan commented Jul 27, 2017

Cortexelus commented Dec 7, 2017 •

edited

Loading

speed of generating speech samples #19

speed of generating speech samples #19

Comments

dengyan commented Jul 27, 2017

Cortexelus commented Dec 7, 2017 • edited Loading

Cortexelus commented Dec 7, 2017 •

edited

Loading