DeepNeuralNetwork

This repo is a collection of neural network implementations from scratch in Python. The purpose of this repo is to understand the inner workings of neural networks and to implement them from scratch.

The idea of building neural nets from scratch mainly came from George Hotz's tinygrad and Andrej Karpathy's micrograd (I've created nanograd while studying it and the associated lecture series.).

Unlike nanograd I won't copy any tutorial directly, but I will use them as a reference to build my own neural network implementations.

References

Educational Material

Book: Dive into Deep Learning
- https://d2l.ai
- My notes and exercise solutions:
  - https://github.com/Daniel-Sinkin/d2ld
Andrej Karpathy
- LeCun et al., 1989 Reproducing
  - https://github.com/karpathy/lecun1989-repro
- micrograd
  - https://github.com/karpathy/micrograd
- Neural Networks: Zero to Hero
  - https://youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&si=tgDHSPy87EMe6sI
tinygrad
- https://github.com/tinygrad/tinygray
3Blue1Brown:
- https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi
Book: Introduction to Statistical Learning
- https://www.statlearning.com
- My notes and exercise solutions:
  - https://github.com/Daniel-Sinkin/statlearning
Book: All of Statistics, Wassermann 2004
- https://link.springer.com/book/10.1007/978-0-387-21736-9
- My notes and exercises solutions:
  - https://github.com/Daniel-Sinkin/Research-and-Development/tree/main/Books-Courses-Exercises/Wasserman-Statistics
  - Currently part of my general RnD Repo, should move this to a seperate Repo soon.
Coursera: Deep Learning Specialization
- https://www.coursera.org/specializations/deep-learning

Literature

Bengio et al., 2003
- A Neural Probabilistic Language Model
- https://www.jmlr.org/papers/volume3/bengio03a/bengio03a.pdf
Vaswani et al., 2017
- Attention is all you need
- The original transformer architecture paper
- https://arxiv.org/abs/1706.03762
Kingma & Ba, 2014
- The paper that introduced the Adam optimizer
- Adam: A Method for Stochastic Optimization
- https://arxiv.org/abs/1412.6980
Redford et al., 2019
- The original GPT2 paper
- Language Models are Unsupervised Multitask Learners
- Code and models:
  - https://github.com/openai/gpt-2
LeCun et al., 1989
- Backpropagation Applied to Handwritten Zip Code Recognition
- http://yann.lecun.com/exdb/publis/pdf/lecun-89e.pdf
Sennrich et al., 2015
- Neural Machine Translation of Rare Words with Subword Units
- https://arxiv.org/abs/1508.07909

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
scrapbook		scrapbook
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.ipynb		main.ipynb
something.ipynb		something.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepNeuralNetwork

References

Educational Material

Literature

About

Releases

Packages

Languages

License

Daniel-Sinkin/DeepNeuralNetwork

Folders and files

Latest commit

History

Repository files navigation

DeepNeuralNetwork

References

Educational Material

Literature

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages