leonardoboulitreau

🤖

Leonardo Boulitreau leonardoboulitreau

🤖

Speech Researcher @CPqD.

14 followers · 40 following

CPqD
São Paulo, Brasil
12:54 (UTC -03:00)
leonardoboulitreau.github.io/

Achievements

Lists (2)

Sort

Datasets 🛢

1 repository

Speech2Speech

1 repository

Stars

thuhcsi / SECap

Python 149 13 Updated Jul 9, 2024

schufo / umss

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

Python 61 9 Updated Mar 19, 2023

gwx314 / TechSinger

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Python 17 1 Updated Dec 15, 2024

MLOps-Courses / mlops-coding-course

Learn how to create, develop, and maintain a state-of-the-art MLOps code base

Python 396 57 Updated Dec 21, 2024

Takaaki-Saeki / DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

Python 135 10 Updated Dec 5, 2024

Danny-NUS / SinTechSVS

Forked from yamathcy/ISMIR2022J-POP

Supplementary Materials of paper "SinTechSVS: A Singing Technique Controllable Singing Voice Synthesis System" by Junchuan Zhao, Low Qi Hong Chetwin, Ye Wang.

4 Updated May 16, 2024

imxtx / awesome-controllabe-speech-synthesis

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

103 2 Updated Dec 10, 2024

hongwen-sun / AudioFormer

AudioFormer:Audio Transformer learns audio feature representations from discrete acoustic codes.SOTA in AudioSet

5 Updated Aug 15, 2023

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,908 135 Updated Dec 31, 2024

DISTRHO / Cardinal

Virtual modular synthesizer plugin

C++ 2,342 155 Updated Dec 31, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 17,948 1,346 Updated Dec 29, 2024

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 3,732 1,193 Updated Dec 30, 2024

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

191 4 Updated Dec 3, 2024

voidful / Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python 236 22 Updated Nov 1, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,834 1,669 Updated Dec 19, 2024

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 1,678 117 Updated Dec 12, 2024

kyutai-labs / moshi

Python 7,079 551 Updated Dec 20, 2024

msplabresearch / MSP-Podcast_Challenge_IS2025

MSP-Podcast Challenge Baseline Code for Interspeech 2025

Python 19 5 Updated Dec 4, 2024

cantabile-kwok / vec2wav2.0

Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995

Python 63 5 Updated Dec 3, 2024

unslothai / unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,874 1,405 Updated Jan 1, 2025

kinggongzilla / ai-clone-whatsapp

Create an AI clone of yourself from your WhatsApp chats (using Llama 3)

Python 362 43 Updated Dec 13, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,789 2,340 Updated Dec 23, 2024