Personal Neural Codec

notes:

python academic/train.py -c exp_configs/experiment_soundstream_2_debug.yaml

scripts to get LibriTTS are in data/get_data_scripts
to run an experiment run python run_test.py -c ./exp_configs/sound_reprod_lucid.yaml -n test.
- c -config path
- n - experiment / run name
exp_configs contains degub flag to test on dummy dataset

TODO:

Add our loggin and wand db
Remove model saving from trainer, move it to logger
Add exp_name saving into model dict
Check coding from SoundStream
Remove installable ResidualVQ, add raw code files
Train on a bigger dataset
Compar results and metrics

Personal Neural Codec

This project aims to build a machine learning model for encoding and decoding voice data. The model focuses on learning efficient representations of voice while maintaining high-quality audio reconstruction.

Experiments

The following table summarizes the experiments conducted in this project:

Experiment Name	Experiment Description
Experiment 1	VQ-VAE model trained on mel spectrograms for audio encoding/decoding
Experiment 2	TBD
Experiment 3	TBD

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
academic		academic
data		data
debug_dataset		debug_dataset
exp_configs		exp_configs
pnc		pnc
.gitignore		.gitignore
README.md		README.md
inference.py		inference.py
requirements.txt		requirements.txt
run_test.py		run_test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personal Neural Codec

Experiments

About

Releases

Packages

Contributors 3

Languages

andorxornot/PersonalNeuralCodec

Folders and files

Latest commit

History

Repository files navigation

Personal Neural Codec

Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages