Skip to content

ldr7/language_model_from_scratch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 

Repository files navigation

nanoGPT

language generation - Shakespeare

The nanoGPT folder contains an implementation of GPT completly from scratch using just numpy & torch.

  1. The first use case explored here is language generation. Specifically, the tiny Shakespeare dataset. Ablation studies are carried out to determine the relative importance of different components of the Transformer architecture using the perfromance of the model.
  2. Text Generation using this model is compared against ChatGPT, GPT-2 & Falcon7B-Instruct. The GPT-2 model is fine tuned first on the Shakespeare dataset.

About

Build a language model from scratch.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published