Naked LLaMA

Build llama inference compute from scrath, only using torch/numpy base ops

Inspired by karpathy's awesome repo nanoGPT, I re-implemented a simple and clear llama model from scratch.

install

pip install torch >= 2.1.0

# transformers is used for convert model weights and compare results
pip install transformers >= 4.35.2

excute & result

git clone https://github.com/silencelamb/naked_llama.git

# convert huggingface model to npy file
python convert_hf_to_pkl.py  # default model_size is 7b

# default model_size is 7b
python naked_llama_forward.py

# run 70 b
python naked_llama.py --model_size 70b

references

llama in huggingface transformers
meta official llama repo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Naked LLaMA

install

excute & result

references

Files

README.md

Latest commit

History

README.md

File metadata and controls

Naked LLaMA

install

excute & result

references