what is the best combination for low-latency? #4

sergenti · 2023-07-20T16:02:21Z

best TTS + STT pair?
is there one that has medium-high quality and a short time to wait between responses?

lugia19 · 2023-07-20T16:55:48Z

In terms of TTS, the only real option is elevenlabs, but it's about as latency-optimized as it can be.
In terms of STT, I'd say local whisper (if your GPU is up to the task of running it).

The smaller the model, the faster the voice recognition runs, but the less accurate it is. For english, I've found the medium size works well enough.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what is the best combination for low-latency? #4

what is the best combination for low-latency? #4

sergenti commented Jul 20, 2023

lugia19 commented Jul 20, 2023

what is the best combination for low-latency? #4

what is the best combination for low-latency? #4

Comments

sergenti commented Jul 20, 2023

lugia19 commented Jul 20, 2023