-
Download Anaconda Prompt
-
in python testing mkdir fcc-gpt-course
-
in python testing python -m venv cuda
-
in python testing cuda\Scripts\activate
-
in python testing pip3 install matplotlib numpy pylzma ipykernel
-
in python testing pip3 install torch --index-url https://download.pytorch.org/whl/cu118
-
check python version 3.8~3.11 using python --version in python testing
-
in fcc-gpt-course type jupyter notebook
-
in fcc-gpt-course python -m ipykernel install --user --name=cuda --display-name "cuda-gpt"
-
type jupyter notebook in fcc-gpt-course
-
check the kernal section in jupyter notebook to see if cuda success
-
download large tar file with all the txt to learn from you can get the file free from https://skylion007.github.io/OpenWebTextCorpus/
-
in fcc-gpt-course\openwebtext run python data-extract.py
-
use data-extract.py to extract the smaller files from the large files don't forget to change the folder path
-
wait for a very very long time until it is done (1~2hours)
-
if eval loss is smaller than train loss it means that the neurons are nearly done
-
pip3 install torch
-
run python chatbot.py -batch_size 32 to start the whole thing
-
play around with block size and batch size to try and fill gpu's memory (try not to use RAM) block_size=input+max_new_tokens if the total size is greater than max_new_tokens than there will be an error so don't forget to increase block_size when input is very large
-
Notifications
You must be signed in to change notification settings - Fork 0
from the project llms from scratch
License
Chen1098/LLMs
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
from the project llms from scratch
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published