Voice-to-Voice Translation System

Voice-to-Voice Translation System

A Gradio-based application for end-to-end voice translation. It uses pyannote for speaker diarization, NVIDIA Canary-1b-v2 for translation, and Piper-TTS for voice synthesis.

Installation

Prerequisites

NVIDIA GPU with CUDA Toolkit 12.8 installed.
Python 3.10 (ensure it's added to your system's PATH).

Setup Steps

Clone the repository:

git clone https://github.com/Juste-Leo2/VoiceToVoice-Translation.git
cd VoiceToVoice-Translation

Create and activate a virtual environment:

python -m venv .venv

# On Windows
.venv\Scripts\activate

# On Linux / macOS
# source .venv/bin/activate

Install dependencies using uv:

python -m pip install --upgrade pip
pip install uv
uv pip install -r requirements.txt --no-deps --index-strategy unsafe-best-match

Usage

Run the application:
```
python app.py
```
Open the local URL provided in the terminal (e.g., http://127.0.0.1:7860) in your browser.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
README_FR.md		README_FR.md
V2.zip		V2.zip
app.py		app.py
downloader.py		downloader.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice-to-Voice Translation System

Installation

Prerequisites

Setup Steps

Usage

About

Uh oh!

Releases

Packages

Languages

License

Juste-Leo2/VoiceToVoice-Translation

Folders and files

Latest commit

History

Repository files navigation

Voice-to-Voice Translation System

Installation

Prerequisites

Setup Steps

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages