Demo of a LLM running on llama.cpp (python bindings)

Setup

Prep virtualenv

Prerequisite: have python3 installed.

python3 -m venv venv # creates venv directory
source venv/bin/activate # enters virtual environment
pip install llama-cpp-python \
  --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu # rm extra-index-url part if runs on GPU

Put a model into models/ dir

Download 8.6G file of model -> llama-2-13b.Q5_K_M.gguf, place in models/ dir.

Run main.py

python3 main.py 2>error.log

NOTE: by default the model writes a lot of information out into STDERR. I filter that out with 2>error.log for you to see later. If you want to see all output, remove 2>error.log, just run python3 main.py.

Notes

llama_cpp_python library used here supports pulling models from HuggingFace directly - link to howto. This allows experiments with other models.

RAG script

Setup

Script env

python3 -m venv venv # creates venv directory
source venv/bin/activate # enters virtual environment
pip install -r requirements.txt

Data for RAG

Create directory data_rag_ru in this project;
Put there PDF files to get the answers data from.

Run

python3 rag_from_pdf.py

then ask your questions from it.

Cheers.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
models		models
.gitignore		.gitignore
README.md		README.md
main.py		main.py
rag_from_pdf.py		rag_from_pdf.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Demo of a LLM running on llama.cpp (python bindings)

Setup

Prep virtualenv

Put a model into models/ dir

Run main.py

Notes

RAG script

Setup

Script env

Data for RAG

Run

About

Releases

Packages

Contributors 2

Languages

peace-for-all/llama4ak

Folders and files

Latest commit

History

Repository files navigation

Demo of a LLM running on llama.cpp (python bindings)

Setup

Prep virtualenv

Put a model into models/ dir

Run main.py

Notes

RAG script

Setup

Script env

Data for RAG

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages