Audio Progressive Resize

Repo to benchmark different audio tagging models and techniques, to support reproducibility and reliable results.

Dataset structure

baseline_dataset houses the abstract audio dataset

Each dataset gets the file list on its own and read a csv file containing the labels(s) for each file.

Normalization stats are calculated inside init

Each dataset has a change_resolution implemented. Depending on the type of dataset, this function either changes the melspectrogram transform, changes the resize transforms or both.

TODO

Instantiate transforms in base dataset
Length normalizer in load audio

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Progressive Resize

Dataset structure

TODO

WIP

About

Releases

Packages

Languages

fColangelo/audio-tagging-benchmark

Folders and files

Latest commit

History

Repository files navigation

Audio Progressive Resize

Dataset structure

TODO

WIP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages