Skip to content
View leonardoboulitreau's full-sized avatar
🤖
🤖

Block or report leonardoboulitreau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 149 13 Updated Jul 9, 2024

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

Python 61 9 Updated Mar 19, 2023

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Python 17 1 Updated Dec 15, 2024

Learn how to create, develop, and maintain a state-of-the-art MLOps code base

Python 396 57 Updated Dec 21, 2024

Reference-aware automatic speech evaluation toolkit

Python 135 10 Updated Dec 5, 2024

Supplementary Materials of paper "SinTechSVS: A Singing Technique Controllable Singing Voice Synthesis System" by Junchuan Zhao, Low Qi Hong Chetwin, Ye Wang.

4 Updated May 16, 2024

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

103 2 Updated Dec 10, 2024

AudioFormer:Audio Transformer learns audio feature representations from discrete acoustic codes.SOTA in AudioSet

5 Updated Aug 15, 2023

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,908 135 Updated Dec 31, 2024

Virtual modular synthesizer plugin

C++ 2,342 155 Updated Dec 31, 2024

SOTA Open Source TTS

Python 17,948 1,346 Updated Dec 29, 2024

A course on aligning smol models.

Jupyter Notebook 3,732 1,193 Updated Dec 30, 2024

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

191 4 Updated Dec 3, 2024

Audio Codec Speech processing Universal PERformance Benchmark

Python 236 22 Updated Nov 1, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,834 1,669 Updated Dec 19, 2024

A fast multimodal LLM for real-time voice

Python 1,678 117 Updated Dec 12, 2024
Python 7,079 551 Updated Dec 20, 2024

MSP-Podcast Challenge Baseline Code for Interspeech 2025

Python 19 5 Updated Dec 4, 2024

Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995

Python 63 5 Updated Dec 3, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,874 1,405 Updated Jan 1, 2025

Create an AI clone of yourself from your WhatsApp chats (using Llama 3)

Python 362 43 Updated Dec 13, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,789 2,340 Updated Dec 23, 2024

End-to-End Speech Processing Toolkit

Python 8,638 2,200 Updated Dec 31, 2024

Automatic headphone equalization from frequency responses

Python 13,632 2,481 Updated Jul 27, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 105,191 8,413 Updated Jan 1, 2025

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 102 2 Updated Dec 13, 2024

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Python 102 6 Updated Sep 20, 2024

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Python 107 8 Updated Oct 18, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,356 226 Updated Dec 31, 2024
Next