Neural Network Model

Neural Network Model
Convolution Neural Network
RNNs and GRUs and Search
Language Modeling using RNNs
AutomaticDifferentiation
Frame Level Classification of Speech
Face Classification & Verification using Convolutional Neural Networks
Utterance to Phoneme Mapping
Attention-based End-to-End Speech-to-Text Deep Neural Network

Neural Network Model

MLP.MLP.mytorch

ConvolutionNeuralNetwork

CNN.CNN.mytorch

RNNs&GRUs&Search

RNNs and GRUs and Search

LanguageModelingusingRNNs

AutomaticDifferentiation

Autograd
a framework that allows us to calculate the derivatives of any arbitrarily complex mathematical function.
- forward accumulation, computes the derivatives of the chain rule from inside to outside
- reverse accumulation, computes the derivatives of the chain rule from outside to inside
Autograd framework keeps track of the sequence of operations that are performed on the input data leading up to the final loss calculation. It then performs backpropagation and calculates all the necessary gradients.

FrameLevelClassificationofSpeech

Data
- MFCC data consisting of 15 features at each time step/frame
Model
- MLP

FaceClassification&VerificationusingConvolutionalNeuralNetworks

Data
- VGGFace2 dataset
Goal
- Classification: classify image with correct identity from 7000 indentities
- Verification: map unkown identity image to known indentity
Model - CNN based architecture, ResNet, ConvNeXt

UtterancetoPhonemeMapping

Data
- MFCC data consisting of 15 features at each time step/frame and 43 phoneme labels
Goal
- seq-to-seq model and deal with the lack of time syschrony
- simplify problem to one that has time syschrony by introducing /BLANK/ symbol
Deconding: From probbability to phoneme sequence
- Greedy decoding
- Beam search decoding
CTC: Connectionist Temporal Classification
Model
- RNN, LSTM,GRU

Attention-basedEnd-to-EndSpeech-to-TextDeepNeuralNetwork

Data

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Preclass		Preclass
simple_deep learning_function		simple_deep learning_function
.DS_Store		.DS_Store
Attention-based End-to-End Speech-to-Text Deep Neural Network.ipynb		Attention-based End-to-End Speech-to-Text Deep Neural Network.ipynb
Face Classification & Verification using Convolutional Neural Networks.ipynb		Face Classification & Verification using Convolutional Neural Networks.ipynb
Frame Level Classification of Speech.ipynb		Frame Level Classification of Speech.ipynb
Readme.md		Readme.md
Utterance to Phoneme Mapping.ipynb		Utterance to Phoneme Mapping.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of contents

Neural Network Model

ConvolutionNeuralNetwork

RNNs&GRUs&Search

LanguageModelingusingRNNs

AutomaticDifferentiation

FrameLevelClassificationofSpeech

FaceClassification&VerificationusingConvolutionalNeuralNetworks

UtterancetoPhonemeMapping

Attention-basedEnd-to-EndSpeech-to-TextDeepNeuralNetwork

Data

About

Releases

Packages

Languages

aytechin/deep-learning

Folders and files

Latest commit

History

Repository files navigation

Table of contents

Neural Network Model

ConvolutionNeuralNetwork

RNNs&GRUs&Search

LanguageModelingusingRNNs

AutomaticDifferentiation

FrameLevelClassificationofSpeech

FaceClassification&VerificationusingConvolutionalNeuralNetworks

UtterancetoPhonemeMapping

Attention-basedEnd-to-EndSpeech-to-TextDeepNeuralNetwork

Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages