📣 New: Meet mini
, the 100 line AI agent that still gets 65% on SWE-bench verified!
This organization contains the source code for several projects in the SWE-* open source ecosystem, including:
- SWE-bench, a benchmark for evaluating AI systems on real world GitHub issues.
- SWE-agent, a system that automatically solves GitHub issues using an LM agent.
- SWE-smith, a toolkit for generating SWE training data at scale.
- mini, an AI agent written in just 100 lines of code that scores 65% on SWE-bench verified
Also check out the supporting infrastructure for working with SWE-* projects