DNN-HMM-based-Acoustic-modelling-on-TIMIT-dataset

Speech recognition based on deep neural network/hidden markov model:

Extracted MFCC features from each frame of phoneme.
Perform the GMM/HMM based Viterbi algorithm.
Prepare unique HMM state IDs. Use this unique HMM state ID to convert the all state sequence obtained in the step 2.

DNN training:

Predict the most likely digit for each utterance by selecting the largest likelihood digit.

Compute the accuracy (# of correct digits / # of test utterances * 100) by using whole training data.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
MLP.py		MLP.py
README.md		README.md
Screenshot from 2022-11-08 19-18-45.png		Screenshot from 2022-11-08 19-18-45.png
Screenshot from 2022-11-08 20-04-09.png		Screenshot from 2022-11-08 20-04-09.png
Util.py		Util.py
data_loader.py		data_loader.py
dnn_hmm_Figure_1.png		dnn_hmm_Figure_1.png
feature_extraction.mkv		feature_extraction.mkv
feature_extraction.py		feature_extraction.py
hmm_dnn.py		hmm_dnn.py
kaldi_60_48_39.map		kaldi_60_48_39.map
main.py		main.py
mapping.py		mapping.py
plot_conf_mat.py		plot_conf_mat.py
requirement.txt		requirement.txt
specs.txt		specs.txt
sphere_to_wav_conversion.mkv		sphere_to_wav_conversion.mkv
test_mlp.ipynb		test_mlp.ipynb
test_mlp.py		test_mlp.py
train.ipynb		train.ipynb

Provide feedback