🐟 Stackfish - Competitive Programming Solver

Stackfish is an open-source LLM-powered pipeline designed to automatically solve competitive programming problems. This pipeline got the highest ranking in the final round of 2024 Hacker Cup AI Closed Track, successfully solving 2 out of the 6 problems.

(See demo of GPT-4o-mini solving a bunch of problems. For harder problems, o1-mini is recommended.)

How It Works

Generate Extra Tests: The LLM reads the problem and creates additional sample tests to cover corner cases.
Form a Hypothesis: The LLM suggest a verbal solution approach.
RAG: The LLM retrieves relevant code snippets of advanced algorithms/data structures from a curated library.
Coding: The LLM writes a C++ solution.
QA & Retry: The solution is tested against sample tests. If it fails, the LLM revises until it works.
Full Execution: Once tests pass, it’s run on the full input set.

Tech Stack

Front-end: Next.js app to manage parallel agents and monitor progress.
LLMs: OpenAI (e.g. GPT) or Llama 3.3, Qwen 32B via Together.ai.
Compute & Testing: Google Cloud Run to safely run and validate solutions at scale.

Repo Structure

./www - Next.js control panel/front-end
./cloud-run-worker - Container for Google Cloud Run to execute code in the cloud
./PROBLEMS - Problems in Hacker Cup format (statement.txt, sample_in.txt, sample_out.txt, full_in.txt)
./SOLUTIONS - Automatically generated solutions placed here
./www/app/services/algo_rag_data - Implementation of ~200 advanced algorithms/data structures, collected from various sources.

Getting Started

Set Keys:
- Add OPENAI_API_KEY or TOGETHER_API_KEY in www/config.env.
Problems Setup:
- Put your Hacker Cup-format problems into ./PROBLEMS/ (see examples)
Run Locally:
```
cd www
npm install
npm run dev
```
Then open http://localhost:3000.
Launch Agents:
- Select a problem from the list
- Click "Let's go!" to start
- Multiple agents will work in parallel to solve it
Models & Config:
- Default model: GPT-4o-mini
- See www/app/config.ts to:
  - Switch between different LLM models
  - Adjust agent settings and parameters
Scale with Cloud Run:
- For higher rate limits and better scaling, deploy your own worker
- Follow setup guide in ./cloud-run-worker/README.md

Enjoy, and happy hacking! 🐟

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐟 Stackfish - Competitive Programming Solver

How It Works

Tech Stack

Repo Structure

Getting Started

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
PROBLEMS		PROBLEMS
SOLUTIONS		SOLUTIONS
cloud-run-worker		cloud-run-worker
www		www
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

License

anton10xr/stackfish

Folders and files

Latest commit

History

Repository files navigation

🐟 Stackfish - Competitive Programming Solver

How It Works

Tech Stack

Repo Structure

Getting Started

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages