This is a repository forked from Coqui-AI (🐸TTS ) used to research about expressive TTS in our AI-Unicamp-CPQD group. The original codes are kept in "main" branch which is not our default visualization.
Here we keep the "unicamp' branch as our main branch, while "main" branch remains as the original and updated. You can see here the original README.md.
We are an expressive TTS research group located at Unicamp and CPQD (Brazil).
- Tacotron 2
- Fastpitch
- EMOVDB
- IEMOCAP
- ESD
- Look-Up
- Reference Encoder (Coarse/Fine-Grained)
- GST
- VAE
- VQ-VAE
- VAE+Flow
- Diffusion
- Style Classifier
- Speaker Classifier + GRL (Gradient Reversal Layer)
- Pitch
- Energy
- Mel-Spectrogram
- Sum, Concat or AdaIN
- Orthogonal Loss
- CLIP Loss
- Cycle consistency Loss(*)