Build your LLM step by step

Download Anaconda Prompt
in python testing mkdir fcc-gpt-course
in python testing python -m venv cuda
in python testing cuda\Scripts\activate
in python testing pip3 install matplotlib numpy pylzma ipykernel
in python testing pip3 install torch --index-url https://download.pytorch.org/whl/cu118
check python version 3.8~3.11 using python --version in python testing
in fcc-gpt-course type jupyter notebook
in fcc-gpt-course python -m ipykernel install --user --name=cuda --display-name "cuda-gpt"
type jupyter notebook in fcc-gpt-course
check the kernal section in jupyter notebook to see if cuda success
download large tar file with all the txt to learn from you can get the file free from https://skylion007.github.io/OpenWebTextCorpus/
in fcc-gpt-course\openwebtext run python data-extract.py
use data-extract.py to extract the smaller files from the large files don't forget to change the folder path
wait for a very very long time until it is done (1~2hours)
if eval loss is smaller than train loss it means that the neurons are nearly done
pip3 install torch
run python chatbot.py -batch_size 32 to start the whole thing
play around with block size and batch size to try and fill gpu's memory (try not to use RAM) block_size=input+max_new_tokens if the total size is greater than max_new_tokens than there will be an error so don't forget to increase block_size when input is very large

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bigram.ipynb		bigram.ipynb
chatbot.py		chatbot.py
gpt-v1.ipynb		gpt-v1.ipynb
torch-examples.ipynb		torch-examples.ipynb
training.py		training.py

Provide feedback