Skip to content

Conversation

@thwu1
Copy link
Contributor

@thwu1 thwu1 commented Nov 2, 2025

This pull request introduces a new example for training math agents using the Tinker, with a focus on matching the behavior and configuration of the original tinker-cookbook rl_loop recipe.

Wandb report compare the rl_loop and rllm tinker intergration.

@jeffreysijuntan jeffreysijuntan marked this pull request as ready for review November 3, 2025 18:42
@jeffreysijuntan jeffreysijuntan changed the title Tinker Implementation [feature] Support Tinker as a backend Nov 3, 2025
@jeffreysijuntan jeffreysijuntan merged commit 8e51f12 into rllm-org:nightly Nov 3, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants