This code starts a voice chat with a LLM. VAD is used to detect start and end of speech. The language and voice models used are:
- Openai Whisper for audio-to-text transcription;
- Llama3-8b with Groq api to improve response speed;
- Openai TTS for text-to-speech.
- create an .env file
OPENAI_API_KEY=
GROQ_API_KEY=
- run the following commands
pip install -r requirement.txt
python app.py