LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,903 197 Updated Apr 19, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 11,549 1,616 Updated Apr 25, 2025

chomeyama / wavehax

Official repository of Wavehax vocoder

Python 46 3 Updated Nov 30, 2024

lucidrains / mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Python 345 9 Updated Jan 12, 2025

dmlguq456 / NeXt_TDNN_ASV

Official repository of NeXt-TDNN for speaker verification

Python 70 7 Updated Oct 10, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 45,568 5,032 Updated Apr 25, 2025

supertone-inc / super-monotonic-align

Python 143 9 Updated Sep 19, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 462 44 Updated Mar 12, 2025

sh-lee-prml / PeriodWave

The official Implementation of PeriodWave and PeriodWave-Turbo

Python 187 11 Updated Apr 14, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 21,493 1,520 Updated Feb 6, 2025

Haoqiu-Yan / PerceptiveAgent

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Python 44 2 Updated Aug 6, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,213 642 Updated May 31, 2024

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,369 184 Updated Sep 26, 2024

scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Python 61 4 Updated Apr 4, 2024

cokelaer / spectrum

Spectral Analysis in Python

Python 354 91 Updated Jan 27, 2025

jwj7140 / Bert-VITS2-Korean

vits2 backbone with multilingual-bert(한국어 지원)

Python 26 1 Updated Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyungchan Yoon (윤형찬) hcy71o

Achievements

Achievements

Block or report hcy71o

Stars

nari-labs / dia

morpho-org / morpho-blue-liquidation-bot

EraX-AI / viF5TTS

reppy4620 / x-vits

line / promptttspp

THUDM / GLM-4-Voice

zama-ai / concrete

NousResearch / DisTrO

liutaocode / TTS-arxiv-daily

yl4579 / StyleTTS-ZS

Xiaobin-Rong / gtcrn

yxlu-0102 / MP-SENet

zhenye234 / FlashSpeech

keonlee9420 / evaluate-zero-shot-tts

ictnlp / LLaMA-Omni