LLM Carnival

Building a decision making agent via scene understanding with iteratively composed LLMs, and tree of thoughts strategy generation and evaluation.

In this iteration, the use case is geared towards video games but the project has been inspired by robot planning and execution in physical, real-world scenarios.

Overview

This project takes a microservices approach where each microservice is isolated in it's own docker container while providing FastAPI endpoints.

🌐 Model Server - hosts local Hugginface models and also connects to model APIs. Primarily making use of Openrouter.ai, emphasis on interoperability between different models.
🎮🔍 Game State - Uses a more capable LLM (Gemma 7B-it) to iteratively understand the scene with a quick and efficient VQA model (BLIP)
⚙️ Action Decision - The decision making engine powered via an LLM (Gemma 7B-it) to build Tree of Thoughts reasoning to determine the next step with self critique.
🖥️✨ Frontend - A simple front end built with NextJS/React to show status of interpreting game state and making a decision

Tech Stack & Diagram

LLMs
- BLIP VQA (hosted locally via HF)
- GPT2 (hosted locally via HF)
- Gemma 2b-it (hosted locally via HF)
- Gemma 7b-it (through Openrouter.ai)
- Langfuse: observability and prompt engineering
Docker / Poetry / Python
FastAPI / httpx
NextJS 14 / React / TS / TailwindCSS

Special shoutout to Langfuse and Openrouter.ai for working with me to update their products so this project could happen.

Install and Run

Clone and navigate to the respository
Creat a .env file in the shared dir with the following entries, visit langfuse.com and openrouter.ai to sign up and generate keys:

HF_TOKEN_GEMMA=""
LANGFUSE_SECRET_KEY="" 
LANGFUSE_PUBLIC_KEY=""
LANGFUSE_HOST="https://cloud.langfuse.com"
OPENROUTER_API_KEY=""

Run docker-compose up --build
Open a browser and head to localhost:3000/dashboard to begin

Known Issues (work in progress)

Use langfuse's new decorators (which this project helped inspire): https://github.com/orgs/langfuse/discussions/1009#discussioncomment-8682887
Iteration on VQA model and prompts for more descriptive output
Improved testing and error handling for APIs
Add production flag for docker images and front end

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
action_decision		action_decision
assets		assets
frontend		frontend
game_state		game_state
model_server		model_server
shared		shared
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Carnival

Overview

Tech Stack & Diagram

Install and Run

Known Issues (work in progress)

Preview in Action

Screenshot gets loaded into game state analyzer

Decision engine acts on summary of game state

About

Releases

Packages

Languages

AshisGhosh/llm-carnival

Folders and files

Latest commit

History

Repository files navigation

LLM Carnival

Overview

Tech Stack & Diagram

Install and Run

Known Issues (work in progress)

Preview in Action

Screenshot gets loaded into game state analyzer

Decision engine acts on summary of game state

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages