GitHub - OpenPipe/art-notebooks: Notebooks to demonstrate ART (Agent Reinforcement Trainer) in practice!

Notebooks to demonstrate ART (Agent Reinforcement Trainer) in practice!

📒 Notebooks

Agent Task	Example Notebook	Description	Comparative Performance
ART•E [Serverless]	🏋️ Train agent	Qwen 3 14B learns to search emails using RULER	benchmarks
2048	🏋️ Train agent	Qwen 3 14B learns to play 2048	benchmarks
ART•E LangGraph	🏋️ Train agent	Qwen 2.5 7B learns to search emails using LangGraph	[Link coming soon]
MCP•RL	🏋️ Train agent	Qwen 2.5 3B masters the NWS MCP server	[Link coming soon]
Temporal Clue	🏋️ Train agent	Qwen 2.5 7B learns to solve Temporal Clue	[Link coming soon]
Tic Tac Toe	🏋️ Train agent	Qwen 2.5 3B learns to play Tic Tac Toe	benchmarks
Codenames	🏋️ Train agent	Qwen 2.5 3B learns to play Codenames	benchmarks
AutoRL [RULER]	🏋️ Train agent	Train Qwen 2.5 7B to master any task	[Link coming soon]

🧩 Supported Models

ART should work with most vLLM/HuggingFace-transformers compatible causal language models, or at least the ones supported by Unsloth. Gemma 3 does not appear to be supported for the time being. If any other model isn't working for you, please let us know on Discord or open an issue on GitHub!

🤝 Contributing

ART is in active development, and contributions are most welcome! Please see the CONTRIBUTING.md file for more information.

⚖️ License

This repository's source code is available under the Apache-2.0 License.

🙏 Credits

ART stands on the shoulders of giants. While we owe many of the ideas and early experiments that led to ART's development to the open source RL community at large, we're especially grateful to the authors of the following projects:

Finally, thank you to our partners who've helped us test ART in the wild! We're excited to see what you all build with it.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
assets/benchmarks		assets/benchmarks
examples		examples
licenses		licenses
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
THIRD-PARTY-NOTICES		THIRD-PARTY-NOTICES
main.py		main.py
pyproject.toml		pyproject.toml
skypilot-config.yaml		skypilot-config.yaml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📒 Notebooks

🧩 Supported Models

🤝 Contributing

⚖️ License

🙏 Credits

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

OpenPipe/art-notebooks

Folders and files

Latest commit

History

Repository files navigation

📒 Notebooks

🧩 Supported Models

🤝 Contributing

⚖️ License

🙏 Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages