murGAN: Singing Voice to Murga singing Style Transfer

This repository contains a simple approach to transfer the singing style of classical or pop singers to the unique style of Uruguayan Murga (of course, it can actually be trained to transfer to any style).

Murga is a modern musical expression native to Uruguay. The singing has distinctive characteristics, especially when compared to pop singers or classical European choral singing. There are significant differences in spectral centroid and spectral flatness. There are significant differences in spectral centroid and spectral flatness. Murga singing often presents deviation from the fundamental frequency, and not much vibrato, among other particularities in terms of intonation and vocal expression.

I use a Generative Adversarial Network architecture, copying the idea from the StarGAN architecture, that the discriminator can not only learn to distinguish real from fake audio, but also of the output of the generator is from a desired domain.

Getting Started

Clone the repository:

git clone [email protected]:ivanlmh/murGAN.git
cd murGAN

Set up the environment:

conda env create -f environment.yml
conda activate murGAN

Install project for development
```
pip install -e .
```

Dataset Preparation

The datasets I use consist of two folders: one with "Murga" style singing and another with non-murga (classical, pop, etc) singing. Each track in the dataset should be at least 10 seconds long to ensure consistent input size for the model. For non-murga I use the vocals from MUSDB18 and the stems from the Choral Singing Dataset (see referances).

In the scripts folder you will find a script that creates symlinks of audios, so that you can add files from other projects to a local data folder, and keep things clean while not taking up more space.

Model Architecture

The model leverages the StarGAN paradigm. For more details on the architecture, refer to the original StarGAN paper.

Training

To train the model, run:

python src/train.py

Ensure that both murga and classic datasets are prepared and placed in their respective directories.

Inference

Once the model is trained, convert classical singing to the "Murga" style using:

python src/inference.py --input "path_to_input_classical_audio.wav" --output "path_to_output_murga_audio.wav"

Tests

To run unit tests

python -m unittest discover tests

Acknowledgments

ChoralSingingDataset https://zenodo.org/records/2649950
MUSDB18 https://sigsep.github.io/datasets/musdb.html#musdb18-compressed-stems
StarGAN https://arxiv.org/abs/1711.09020

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
enviroment.yml		enviroment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

murGAN: Singing Voice to Murga singing Style Transfer

Getting Started

Dataset Preparation

Model Architecture

Training

Inference

Tests

Acknowledgments

About

Releases

Packages

Languages

License

ivanlmh/murGAN

Folders and files

Latest commit

History

Repository files navigation

murGAN: Singing Voice to Murga singing Style Transfer

Getting Started

Dataset Preparation

Model Architecture

Training

Inference

Tests

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages