GitHub - NoahBishop/index-tts

👉🏻 IndexTTS2 👈🏻

IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech

IndexTTS

A Windows-focused deployment of the IndexTTS project, using a Conda environment for dependency management instead of the UV. This setup is specifically designed for users running Windows with an NVIDIA GPU.

Note: This project is a modified setup of the original IndexTTS repository. All credit for the core model and research goes to the original authors.

🚀 Key Changes from Official Setup

Conda Environment: Uses Anaconda for managing Python packages and dependencies, which is a familiar tool for many Windows users in the ML community.

Using Modelscope download reather than hf

create conda env:

conda create -n index-tts -y python=3.10
conda activate index-tts
conda install -c conda-forge ffmpeg

clone project and setup env Note: because 'pip install torch' will install cpu version now We use '-f https://mirrors.aliyun.com/pytorch-wheels/youcudaversion' find gpu version torch use

git clone https://github.com/NoahBishop/index-tts.git
cd index-tts
pip install torch -f https://mirrors.aliyun.com/pytorch-wheels/cu126/
pip install torchaudio -f https://mirrors.aliyun.com/pytorch-wheels/cu126/
pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host=mirrors.aliyun.com

download models to local:

modelscope download --model IndexTeam/IndexTTS-2 --local_dir checkpoints
modelscope download --model facebook/w2v-bert-2.0 --local_dir models/facebook/w2v-bert-2.0
modelscope download --model amphion/MaskGCT semantic_codec/model.safetensors --local_dir models/amphion/MaskGCT
modelscope download --model iic/speech_campplus_sv_zh-cn_16k-common campplus_cn_common.bin --local_dir models/iic/speech_campplus_sv_zh-cn_16k-common
modelscope download --model nv-community/bigvgan_v2_22khz_80band_256x bigvgan_generator.pt --local_dir models/nv-community/bigvgan_v2_22khz_80band_256x
modelscope download --model nv-community/bigvgan_v2_22khz_80band_256x config.json --local_dir models/nv-community/bigvgan_v2_22khz_80band_256x

run:

where python
"your env python path" webui.py

🙏 Acknowledgments

Original IndexTTS project: https://github.com/index-tts/index-tts/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
checkpoints		checkpoints
indextts		indextts
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt
webui.py		webui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

👉🏻 IndexTTS2 👈🏻

IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech

IndexTTS

🚀 Key Changes from Official Setup

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

NoahBishop/index-tts

Folders and files

Latest commit

History

Repository files navigation

👉🏻 IndexTTS2 👈🏻

IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech

IndexTTS

🚀 Key Changes from Official Setup

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages