Pixels-Based Sim2Real Demo for Aloha Peg Insertion #76

Andrew-Luo1 · 2025-03-07T23:52:50Z

A demo of using Madrona MJX as a "real-world adapter". A standard playground-style state-based peg-insertion policy is first trained in sim. Madrona MJX is then used to distill this policy into a deployable pixel-based policy in 3 minutes.

Note that this PR relies on a concurrent Brax PR that implements Online Dagger for behaviour cloning.

Please see the technical report for more details on the Aloha Peg Insertion task.

Both phases of the teacher policy show stable training.

The distillation to the student is also stable. (2.5e6 samples corresponds to 3m30s wall-clock)

Looking forward to incorporating any feedback!

btaba

Hey Andrew, this is really awesome. First two high-level comments are: [1] move/rename s2r to follow the same pattern we have elsewhere in the repo (potentially nix aloha single_peg that exists currently, since this is a superior version for the teacher policy), and [2] nix the param files, or convert the visionMLP to orbax (it's small enough that I think it's OK to include in git history).
Really excited to help get this checked in!

…ew brax

…nd_new

btaba

Thanks @Andrew-Luo1 ! LGTM modulo the use of pickles and small nits. I'm happy to make the pkl change once I get a good GPU again and can run this locally (for now I'm a bit bottlenecked on the hardware side). If you get to making the changes, we can merge this in.

btaba · 2025-04-11T05:10:29Z

mujoco_playground/_src/manipulation/aloha/s2r/depth_noise.py

+
+
+def apply_line_noise(img, line_noise):
+  return _or_reduce(jp.stack([img, line_noise]), axis=0)


could this just be jp.where(line_noise != 0, line_noise, img) ?

@Andrew-Luo1 same question as before

Yes nice catch

btaba · 2025-04-23T23:04:45Z

learning/train_jax_ppo.py

@@ -361,6 +355,12 @@ def progress(num_steps, metrics):
    print(f"Time to JIT compile: {times[1] - times[0]}")
    print(f"Time to train: {times[-1] - times[1]}")

+  if _SAVE_PARAMS_PATH.value is not None:
+    model.save_params(epath.Path(_SAVE_PARAMS_PATH.value).resolve(), params)


@Andrew-Luo1 would really like to not use the pkl stuff, is this absolutely necessary?

btaba · 2025-04-23T23:11:34Z

mujoco_playground/_src/manipulation/aloha/base.py

@@ -37,6 +37,9 @@ def get_assets() -> Dict[str, bytes]:
  path = mjx_env.ROOT_PATH / "manipulation" / "aloha" / "xmls"
  mjx_env.update_assets(assets, path, "*.xml")
  mjx_env.update_assets(assets, path / "assets")
+  path = mjx_env.ROOT_PATH / "manipulation" / "aloha" / "xmls" / "s2r"


no longer needed FWIU

btaba · 2025-04-23T23:17:19Z

mujoco_playground/_src/manipulation/aloha/distillation.py

+
+  f_pick_teacher = pathlib.Path(__file__).parent / 'params' / 'AlohaPick.prms'
+  f_insert_teacher = (
+      pathlib.Path(__file__).parent / 'params' / 'AlohaPegInsertion.prms'


I guess these were removed and not converted to orbax? Nbd, I would still merge, but at least add a comment or more informative error

btaba · 2025-04-23T23:18:41Z

mujoco_playground/_src/manipulation/aloha/peg_insertion.py

+import pathlib
+from typing import Any, Dict, Optional, Tuple, Union
+
+from brax.io import model as brax_loader


would really like brax.io.model nixed from this PR if it can be redone using the existing checkpointing mechanism

btaba · 2025-04-23T23:22:10Z

mujoco_playground/_src/manipulation/aloha/xmls/mjx_aloha_single.xml

Naming is getting a bit confusing. "single" is overloaded to mean single peg and single arm

Maybe name is "mjx_aloha_single_arm.xml" and similar

maybe mjx_half_aloha and mjx_half_scene? Fine with the single_arm convention too

SGTM for half!

Andrew-Luo1 · 2025-06-09T01:58:05Z

Apologies for the very late update - life's been hectic. It'd be great to get another review!

Andrew-Luo1 added 9 commits March 16, 2025 17:29

aloha sim to real code first pass

60cc7df

Update README.md

d390cf8

code formatting

bf98c98

Update README.md

e7ebcd5

Update README.md

6782db2

pass code quality checks

4dd6616

update brax dagger api

7bfd533

clean up domain randomization

2cce751

clean up train_dagger.py

b4ae622

Andrew-Luo1 force-pushed the main branch from 2be5bc5 to b4ae622 Compare March 16, 2025 21:31

Andrew-Luo1 added 2 commits March 16, 2025 22:29

Update README.md

bc3bfb0

Revert README.md

c534d57

btaba requested changes Apr 10, 2025

View reviewed changes

Andrew-Luo1 added 5 commits April 14, 2025 11:21

update visionmlp to use orbax checkpointing, update for compat with n…

ccace15

…ew brax

merge the s2r into the main aloha folder

25d8db7

Merge branch 'main' of https://github.com/Andrew-Luo1/mujoco_playgrou…

ec030db

…nd_new

add frozen encoder orbax checkpoint

6c31bc2

Update README.md

aea048c

btaba requested changes Apr 23, 2025

View reviewed changes

Andrew-Luo1 added 3 commits April 23, 2025 20:47

remove unnecessary helper function, rename files

9b3d755

remove learning/train_jax_ppo.py dependency on brax.io.model

d40eb3f

everything working with orbax checkpointing

fb34541

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pixels-Based Sim2Real Demo for Aloha Peg Insertion #76

Pixels-Based Sim2Real Demo for Aloha Peg Insertion #76

Uh oh!

Andrew-Luo1 commented Mar 7, 2025 •

edited

Loading

Uh oh!

btaba left a comment

Uh oh!

btaba left a comment

Uh oh!

btaba Apr 11, 2025

Uh oh!

btaba Apr 23, 2025

Uh oh!

Andrew-Luo1 Apr 24, 2025

Uh oh!

btaba Apr 23, 2025

Uh oh!

btaba Apr 23, 2025

Uh oh!

btaba Apr 23, 2025

Uh oh!

btaba Apr 23, 2025

Uh oh!

btaba Apr 23, 2025

Uh oh!

Andrew-Luo1 Apr 24, 2025 •

edited

Loading

Uh oh!

btaba Apr 24, 2025

Uh oh!

Andrew-Luo1 commented Jun 9, 2025

Uh oh!

Uh oh!



		def apply_line_noise(img, line_noise):
		return _or_reduce(jp.stack([img, line_noise]), axis=0)

Pixels-Based Sim2Real Demo for Aloha Peg Insertion #76

Are you sure you want to change the base?

Pixels-Based Sim2Real Demo for Aloha Peg Insertion #76

Uh oh!

Conversation

Andrew-Luo1 commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

btaba left a comment

Choose a reason for hiding this comment

Uh oh!

btaba left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Andrew-Luo1 Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Andrew-Luo1 commented Jun 9, 2025

Uh oh!

Uh oh!

Andrew-Luo1 commented Mar 7, 2025 •

edited

Loading

Andrew-Luo1 Apr 24, 2025 •

edited

Loading