OLMo 2 Retrofit #895

mnoukhov · 2025-08-14T20:33:42Z

No description provided.

…trofit

* Add Claude Code GitHub Workflow (#959) * "Claude PR Assistant workflow" * "Claude Code Review workflow" * Fix DPO script (#956) * minor fix * minor fix * lint * update debug script * update debug script --------- Co-authored-by: Finbarr Timbers <[email protected]> Co-authored-by: Hamish Ivison <[email protected]>

claude · 2025-08-28T22:57:26Z

PR Review: OLMo 2 Retrofit

I've reviewed this pull request that retrofits the codebase for OLMo 2.5 support. Here's my analysis:

✅ Positive Changes

Proper Flash Attention Migration: The change from use_flash_attention_2=True/False to attn_implementation="flash_attention_2"/"eager" in dpo_tune_cache.py:626 follows the newer transformers API pattern and is consistent with other files in the codebase.
Import Cleanup: Moving from setup_logger to logger_utils.setup_logger() in dpo_tune_cache.py:527 improves code organization.
GitHub Workflows: The added Claude Code review workflows (.github/workflows/claude-code-review.yml and .github/workflows/claude.yml) will provide automated code review capabilities.

⚠️ Areas of Concern

Dependency Management Issues

External Git Dependencies:
- transformers @ git+https://github.com/2015aroras/transformers.git@shanea/olmo2-retrofit (pyproject.toml:26)
- Custom vLLM fork cloned in Dockerfile:68 and Makefile:21
Risk: External forks may become stale, unsupported, or introduce security vulnerabilities. Consider:
- Documenting the specific changes needed from these forks
- Creating a plan to migrate back to official releases
- Regular security audits of these dependencies
Pinned Ray Version: Changed from >=2.44.1 to ==2.46.0 (pyproject.toml:22) - this is good for reproducibility but ensure compatibility testing.

Configuration & Deployment

Hard-coded Paths: scripts/train/rlvr/grpo_olmo25.sh contains hard-coded paths like /weka/oe-training-default/ai2-llm/checkpoints/... (line 19). Consider using environment variables or configuration files.
Docker Build Dependencies: The Dockerfile clones a git repository during build (line 68), which could fail if the repository becomes unavailable during builds.

Code Quality

Missing Error Handling: The GRPO script doesn't include error handling for the git clone operations or path validations.
Resource Configuration: The training script specifies very high resource requirements (4 nodes, 24 vLLM engines) without validation or fallback options.

🔧 Recommendations

Dependency Security:
- Add version pinning for the git dependencies
- Consider using commit SHAs instead of branch names for better reproducibility
- Document the specific patches/changes needed
Configuration Management: Replace hard-coded paths with environment variables
Error Handling: Add validation for critical paths and dependencies in scripts.
Documentation: Consider adding a README section explaining the OLMo 2.5 retrofit changes and setup requirements.

🧪 Test Coverage

The PR doesn't include new tests for the OLMo 2.5 specific functionality. Consider adding:

Integration tests for the new model loading paths
Validation tests for the attention implementation changes

🔒 Security

The GitHub workflow tokens and Docker operations appear secure
No hardcoded secrets detected
External dependencies should be monitored for security updates

Overall Assessment

This is a functional retrofit for OLMo 2.5 support, but the heavy reliance on external forks and hard-coded configurations presents maintenance challenges. The code changes themselves are technically sound and follow established patterns in the codebase.

Recommendation: ✅ Approve with conditions - address dependency management and add error handling before merging to production environments.

…trofit

try to avoid concurrency issues with them

…trofit

additional_env_vars is a List[Dict] not a Dict so we need to extract the "name" field to check if its in the list

…trofit

docker image doesn't have jq so get username without it update comment to remove reference to removed flag --oe_eval_log_to_wandb

cursor · 2025-09-26T05:44:16Z

scripts/eval/oe-eval.sh

 WANDB_ARG=""
 if [[ -n "$WANDB_RUN_PATH" ]]; then
-    beaker_user=$(beaker account whoami --format json | jq -r '.[0].name')
+    beaker_user=$(beaker account whoami --format text | awk 'NR==2 {print $2}')


Bug: Username Extraction Reliability Issue

The beaker username extraction switched from robust JSON parsing (jq) to fragile text parsing (awk). This new approach relies on a specific text output format (line 2, column 2), making it vulnerable to breaking if the beaker account whoami command's output changes.

cursor · 2025-09-26T05:44:17Z

scripts/train/rlvr/grpo_olmo25_debug.sh

+    --gpus ${NUM_GPUS} \
+    --budget ai2/oe-adapt \
+    -- \
+source configs/beaker_configs/ray_node_setup.sh \&\& \


Bug: Shell Script Command Chaining Error

The shell commands use \&\& instead of &&. The backslash escapes the ampersand, which prevents proper command chaining and will cause the script to fail.

Additional Locations (1)

scripts/train/rlvr/grpo_olmo25.sh#L37-L38

all changes from olmo3 but for olmo2.5

77f91c3

mnoukhov force-pushed the olmo2-retrofit branch from ea52adf to 77f91c3 Compare August 14, 2025 20:44

mnoukhov and others added 26 commits August 14, 2025 20:44

example script

13c057b

fix path and uv lock

ee61222

olmo2 retrofit naming

1423264

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

ed2ec83

…trofit

updated script

a663287

makefile delete old image

abe3902

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

7d74b69

…trofit

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

e5002cc

…trofit

resumable

8dcf71c

fix for 4 nodes maybe

2ea2e37

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

f2d6e97

…trofit

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

93b88e2

…trofit

revert change, 3 - 1 node still not working

667963b

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

e2925ec

…trofit

custom vllm in pyproject no need to clone

18e3f7c

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

5820756

…trofit

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

512651d

…trofit

vllm is extra dependency

ebcac11

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

6cbafa3

…trofit

make vllm a dependency either way but do local vllm as extra

1e5e1f9

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

24c8dc8

…trofit

back to basics, make setup to git clone

77048d2

editable

cb87b45

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

d0b6bfc

…trofit

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

6cac122

…trofit

debug script

3ed8657

mnoukhov added 15 commits September 4, 2025 17:25

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

20354e3

…trofit

synchronous weight sync

ef9e855

start generate thread trigger event

bd28584

single weight sync and generate thread

e43aa8e

try to avoid concurrency issues with them

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

5aef3bd

…trofit

sync weight sync

1bea281

cleanup

ce3fec0

fix env var check

ee243ef

additional_env_vars is a List[Dict] not a Dict so we need to extract the "name" field to check if its in the list

temporary logging

755ac15

disable log stats

782ac53

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

ad89b37

…trofit

fix lock file and revert extra logging

54ed043

un-revert weight sync

d9dd800

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

54c9a39

…trofit

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

551f58c

…trofit

This comment was marked as outdated.

Sign in to view

good r1-zero script and olmo simple thinker template

4b3dd54

This comment was marked as outdated.

Sign in to view

mnoukhov added 6 commits September 14, 2025 22:47

2 epochs

3f21704

deepseek evals

5455f64

shorter run

f188425

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

7177b0a

…trofit

filtering vllm top p

e00ef62

fix copy since we need the folder

cfc9b8d

This comment was marked as outdated.

Sign in to view

mnoukhov added 2 commits September 25, 2025 21:44

better defaults

4e0082d

Merge branch 'main' of github.com:allenai/open-instruct into olmo2-re…

04ee33d

…trofit

This comment was marked as outdated.

Sign in to view

fix beaker whoami and update warning message

7e564d1

docker image doesn't have jq so get username without it update comment to remove reference to removed flag --oe_eval_log_to_wandb

cursor bot reviewed Sep 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OLMo 2 Retrofit #895

OLMo 2 Retrofit #895

Uh oh!

mnoukhov commented Aug 14, 2025

Uh oh!

claude bot commented Aug 28, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Sep 26, 2025

Uh oh!

cursor bot Sep 26, 2025

Uh oh!

Uh oh!

OLMo 2 Retrofit #895

Are you sure you want to change the base?

OLMo 2 Retrofit #895

Uh oh!

Conversation

mnoukhov commented Aug 14, 2025

Uh oh!

claude bot commented Aug 28, 2025

PR Review: OLMo 2 Retrofit

✅ Positive Changes

⚠️ Areas of Concern

Dependency Management Issues

Configuration & Deployment

Code Quality

🔧 Recommendations

🧪 Test Coverage

🔒 Security

Overall Assessment

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Sep 26, 2025

Choose a reason for hiding this comment

Bug: Username Extraction Reliability Issue

Uh oh!

cursor bot Sep 26, 2025

Choose a reason for hiding this comment

Bug: Shell Script Command Chaining Error

Uh oh!

Uh oh!