TTS Microservice

TTS (Text-To-Speech) microservice helps users convert text to speech. When building a talking bot with LLM, users might need an LLM generated answer in audio format. This microservice is built for that conversion stage.

1.2 Start SpeechT5 Service/Test

Xeon CPU

cd dependency/
nohup python speecht5_server.py --device=cpu &
curl http://localhost:7055/v1/tts -XPOST -d '{"text": "Who are you?"}' -H 'Content-Type: application/json'

Gaudi2 HPU

pip install optimum[habana]

cd dependency/
nohup python speecht5_server.py --device=hpu &
curl http://localhost:7055/v1/tts -XPOST -d '{"text": "Who are you?"}' -H 'Content-Type: application/json'

1.3 Start TTS Service/Test

python tts.py

curl http://localhost:9088/v1/audio/speech -XPOST -d '{"text": "Who are you?"}' -H 'Content-Type: application/json'

🚀2. Start Microservice with Docker (Option 2)

Alternatively, you can start the TTS microservice with Docker.

2.1 Build Images

2.1.1 SpeechT5 Server Image

Xeon CPU

cd ../../../
docker build -t opea/speecht5:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/tts/speecht5/dependency/Dockerfile .

Gaudi2 HPU

cd ../../../
docker build -t opea/speecht5-gaudi:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/tts/speecht5/dependency/Dockerfile.intel_hpu .

2.1.2 TTS Service Image

docker build -t opea/tts:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/tts/speecht5/Dockerfile .

2.2 Start SpeechT5 and TTS Service

2.2.1 Start SpeechT5 Server

Xeon

docker run -p 7055:7055 --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy opea/speecht5:latest

Gaudi2 HPU

docker run -p 7055:7055 --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e OMPI_MCA_btl_vader_single_copy_mechanism=none --cap-add=sys_nice --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy opea/speecht5-gaudi:latest

2.2.2 Start TTS service

ip_address=$(hostname -I | awk '{print $1}')

docker run -p 9088:9088 --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy -e TTS_ENDPOINT=http://$ip_address:7055 opea/tts:latest

2.2.3 Test

curl http://localhost:7055/v1/tts -XPOST -d '{"text": "Who are you?"}' -H 'Content-Type: application/json'

# openai protocol compatible
# voice can be 'male' or 'default'
curl http://localhost:7055/v1/audio/speech -XPOST -d '{"input":"Who are you?", "voice": "male"}' -H 'Content-Type: application/json' --output speech.wav

curl http://localhost:9088/v1/audio/speech -XPOST -d '{"text": "Who are you?"}' -H 'Content-Type: application/json'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

TTS Microservice

1.2 Start SpeechT5 Service/Test

1.3 Start TTS Service/Test

🚀2. Start Microservice with Docker (Option 2)

2.1 Build Images

2.1.1 SpeechT5 Server Image

2.1.2 TTS Service Image

2.2 Start SpeechT5 and TTS Service

2.2.1 Start SpeechT5 Server

2.2.2 Start TTS service

2.2.3 Test

Files

README.md

Latest commit

History

README.md

File metadata and controls

TTS Microservice

1.2 Start SpeechT5 Service/Test

1.3 Start TTS Service/Test

🚀2. Start Microservice with Docker (Option 2)

2.1 Build Images

2.1.1 SpeechT5 Server Image

2.1.2 TTS Service Image

2.2 Start SpeechT5 and TTS Service

2.2.1 Start SpeechT5 Server

2.2.2 Start TTS service

2.2.3 Test