Llama Deploy

This repo contains source code for deploying Llama2 via a REST API endpoint using FastAPI and Docker.

This project was created to provide foundation ground for deploying LLMs using FastAPI in production.

How to install

The assumption is that you have Pipenv installed on your computer.

Follow the steps below

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
utils		utils
.DS_Store		.DS_Store
.env-sample		.env-sample
.gitignore		.gitignore
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py