GitHub - kamel402/MaqamNet: A model that can predict the mqam type of a given audio file.

MaqamNet

MaqamNet is a model that can predict the mqam type of a given audio file.
The model trained on 4 types out of 8 existed types, which are Risat, Hijaz, Sika, and Ajam.

Data

Because I didn't find any Dataset of maqamat, I collected a few audio files from YouTube, and because of the privacy issue I can't share this Dataset, but you can make your own Data.

Model

9 1D conv layers and input sample size of 59049 (~3 seconds)

Procedures

Fix config.py file
Data processing
- run python audio_processor.py : audio (to read audio signal from mp3s and save as npy)
- run python annot_processor.py : annotation (process redundant tags and select top N=4 tags)
  - this will create and save train/valid/test annotation files
Training
- You can set multigpu option by listing all the available devices
- Ex. python main.py --gpus 0 1
- Ex. python main.py will use 1 gpu if available as a default

Tag prediction

run python eval_tags.py --gpus 0 1 --mp3_file "path/to/mp3file/to/predict.mp3"

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
model		model
README.md		README.md
__init__.py		__init__.py
annot_processor.py		annot_processor.py
audio_processor.py		audio_processor.py
config.py		config.py
data_loader.py		data_loader.py
eval_tags.py		eval_tags.py
main.py		main.py
model.py		model.py
solver.py		solver.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MaqamNet

Data

Model

Procedures

Tag prediction

References

About

Releases

Packages

Contributors 2

Languages

kamel402/MaqamNet

Folders and files

Latest commit

History

Repository files navigation

MaqamNet

Data

Model

Procedures

Tag prediction

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages