Deep Learning Project: Audio Tagging Task

This project is about tagging audio clips that are classified with one of 41 classes. The project implements a few classifiers. We present audio classifiers based on CNNs and VGGish architectures. Each network can take log-mel, mfcc, or chroma features as input. Predictions (over 41 classes) provided by classifiers are considered as their output.

Getting Started

Please follow these instructions to install all requirements and use the software correctly.

Requirements

The project is written in Python 3.8.1 programming language. Neural networks are implemented using PyTorch framework. LibROSA package for music and audio analysis is used to extract the features. We also make use scikit-learn package to compute evaluation measures of proposed classifiers.

Datasets must be used from Zenodo and placed into folders as described in hparams.yaml. Namely, folder Datasets must include directory FSDKaggle2018.audio_train with training audios, FSDKaggle2018.audio_test with test audios, FSDKaggle2018.meta with csv meta-files train_post_competition.csv and test_post_competition_scoring_clips.csv. We highly recommend keeping this structure.

Installing

To install all required modules of the appropriate version, please run the following command:

pip install -r requirements.txt

Running

python main.py train --model={VGGish,CNN} --features={log_mel,mfcc,chroma} --validate --manually_verified_only --shuffle --cuda --verbose

python main.py validation --model={VGGish,CNN} --features={log_mel,mfcc,chroma} --epoch=EPOCH --validated --manually_verified_only --shuffle --cuda --verbose

python main.py test --model={VGGish,CNN} --features={log_mel,mfcc,chroma} --epoch=EPOCH --validated --manually_verified_only --cuda --verbose

Plotting the results

python plot.py train --model={VGGish,CNN} --features={log_mel,mfcc,chroma} --validated --manually_verified_only --latex --verbose

python plot.py validation --model={VGGish,CNN} --features={log_mel,mfcc,chroma} --epoch=EPOCH --validated --manually_verified_only --latex --verbose

python plot.py test --model={VGGish,CNN} --features={log_mel,mfcc,chroma} --epoch=EPOCH --validated --manually_verified_only --latex --verbose

Use python {main,plot}.py {train,validation,test} -h[--help] for detail.

Built With

Authors

Roman Overko
Dmytro Mishagli
Xuesong Zhang

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
hparams.yaml		hparams.yaml
main.py		main.py
plot.py		plot.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Project: Audio Tagging Task

Getting Started

Requirements

Installing

Running

Plotting the results

Built With

Authors

About

Releases

Packages

Languages

roman1e2f5p8s/audio_classification

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Project: Audio Tagging Task

Getting Started

Requirements

Installing

Running

Plotting the results

Built With

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages