Skip to content

Beagle-Boys/Music-Genre-Classification

Repository files navigation

Methodology

Analyse

  • Waveform

    Waveform

  • FFT Spectrum

    FFT

  • Log Spectogram

    Log Spectogram

  • MFCC

    MFCC


Preprocess

  • divided each .wav file into 5 segments to increse training data.
  • exctracted mfcc for each segment.
  • stored the complete dataset in format :
    {
        "mapping" : ["classical","blues",...],
        "mfcc": [[[...],[...],...,[...]],...],
        "labels": [0,2,...]
    }

Simple ANN

  • Architecture

    Archiecture

  • Result

    Plots

  • Comments :

    • As we can see, the model is overfitting.

Simple ANN with Dropout and Kernel Regularization

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • The model is not overfitting anymore.

Convolutional Neural Network

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • CNN performs very well.
    • The training data had to be reshaped since CNN required 3 dimensional input.
    • Fastest

Recurrent Neural Network (LSTM)

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • LSTM performs very well.
    • Slowest

Recurrent Neural Network (GRU)

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • GRU performs very well. Better than LSTM.
    • Slowest (as slow as LSTM)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published