Speech-OraAgent is an open-source Automated-Agent Driven aackend for real-time speech-to-speech conversion, leveraging state-of-the-art models from the Hugging Face ecosystem to create interactions similar to GPT-like conversations.
This Project implements a speech-to-speech Server-Side/Client-side backend calling APis System design with the following components based Open source model used to built so :
- Voice Activity Detection (VAD): Powered by Silero VAD v5.
- Speech to Text (STT): Uses Whisper models from the Hugging Face hub.
- Language Model (LM): Any Hugging Face instruct model can be used.
- Text to Speech (TTS): Uses Parler-TTS for speech synthesis.
Note : you can re-load a different Open Source model Or using APIs support Model Via Key APIs by Providers
The pipeline is modular and flexible, allowing customization at each stage:
- VAD: Integrates the Silero VAD.
- STT: Compatible with any Whisper model, including Distil-Whisper and multilingual variants.
- LM: Swap language models easily via the Hugging Face model ID.
- TTS: Uses Parler-TTS architecture but supports multiple checkpoints, including multilingual ones.
All components are implemented as independent classes for easy customization.
-
Clone the repository:
git clone https://github.com/deep-matter/Speech-Ora cd Speech-Ora
-
Install the required dependencies:
pip install -r requirements.txt
here stpes to run docker-Image of full the code run top on Containter .
Before make to have Access into GPU Local host access into your container
- Follow the Commands line to setup your own Access Key GPU Locally to used by Docker Image
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
- install Nvidia-Toolkit container
sudo apt-get install -y nvidia-container-toolkit
- Now, configure the Docker daemon to recognize the NVIDIA Container Runtime:
sudo nvidia-ctk runtime configure --runtime=docker