GitHub - fcuni/litnanogpt: Playground to re-implement NLP research

In this project I have implemented every aspect from scratch, as well as the training pipeline. The end goal is to have a fully functional GPT2 model, that can replicate the results of NanoGPT, and that has a clean integration with Hugging Face and Lightning for future flexibility.

This repository is a work in progress, and I will be updating it as I go along.

Progress list:

[x] Replicate NanoGPT on tiny_shakespeare

[ ] Introduce MoE code

[ ] Implement Llama 2

[ ] Implement Llama 3

Resources and references: //TODO//

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
nanogpt		nanogpt
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.style.yapf		.style.yapf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

fcuni/litnanogpt

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages