Add gym make support for Meta-World envs #498

reginald-mclean · 2024-08-29T20:38:11Z

This PR is built on top of #499 to add gym.make support for the environments in Meta-World. They are organized as follows:

gym.make('Meta-World/env-name'): this creates a single environment with a single goal
gym.make('Meta-World/MT1-env-name'): This creates a version of the env-name environment with multiple goals, but no testing goals. This is typically used to test how an RL algorithm can acquire skills in a goal conditioned fashion
gym.make('Meta-World/ML1-train-env-name') or gym.make('Meta-World/ML1-test-env-name'): These commands make the training environment or testing environment for Meta-Learning problems with a single environment.

And there are also gym.make_vec commands that return multiple environments wrapped in a sync or async wrapper:

gym.make_vec('Meta-World/MT10-sync') or gym.make_vec('Meta-World/MT10-async'): This returns the MT10 set of environments either in sync or async mode.
gym.make_vec('Meta-World/ML10-train-sync') or gym.make_vec('Meta-World/MT10-train-async'): This returns the ML10 set of training environments either in sync or async mode.
gym.make_vec('Meta-World/ML10-test-sync') or gym.make_vec('Meta-World/ML10-test-async'): This returns the ML10 set of testing environments either in sync or async mode.
gym.make_vec('Meta-World/ML45-train-sync') or gym.make_vec('Meta-World/ML45-train-async'): This returns the ML10 set of training environments either in sync or async mode.
gym.make_vec('Meta-World/ML45-test-sync') or gym.make_vec('Meta-World/ML45-test-async'): This returns the ML10 set of testing environments either in sync or async mode.

… CleanRL method

…rld into reginald-mclean-removeV1_refactor

)

rainx0r · 2024-08-30T10:52:04Z

Few things I noticed:

It's not entirely clear to me why _make_single_env() has ML1-related code when _make_single_ml() exists.
For MT envs, they should also have an AutoTerminateOnSuccess wrapper but it should be toggled to the initial state specified by the terminate_on_success flag as done here.
use_one_hot is set to False for all registered envs but it should be True for MT10 and MT50.
The way seed is added to the TaskSelect wrappers could probably cause some weird issues as it would reinitialise the global numpy rng state multiple times during env instantiation, and in general it's not necessary so it should probably be removed. The wrappers use the underlying env's np_random anyway so they don't really need to be seeded. Maybe the seed passed into init_each_env should just be used on the env directly:
```
env = env_cls()
if seed:
    env.seed(seed)
```

Also I think for simplicity it should be possible to just have a single definition of init_each_env that is used for both MT and ML that takes in env_cls, tasks, task_select_method, maybe seed, maybe max_episode_steps, maybe use_one_hot / env_id / num_tasks and has branching logic for OneHotWrapper and the task select method. MT envs just provide those one-hot parameters and use all tasks, while ML envs handle the task splitting a bit differently but they otherwise use the same wrappers and logic.

reginald-mclean · 2024-08-30T15:49:34Z

@rainx0r

It's not entirely clear to me why _make_single_env() has ML1-related code when _make_single_ml() exists.

Good catch, remnant of previous attempt at creating ML envs.

For MT envs, they should also have an AutoTerminateOnSuccess wrapper but it should be toggled to the initial state specified by the terminate_on_success flag as done

Will update

use_one_hot is set to False for all registered envs but it should be True for MT10 and MT50.

I don't know if I agree with this. We can include the wrapper for completeness and show examples of enabling it, but I think we shouldn't influence users to use the wrapper by default.

Maybe the seed passed into init_each_env should just be used on the env directly

Agreed

Also I think for simplicity it should be possible to just have a single definition of init_each_env

It would be possible but I think it might become a bit of a convoluted function to write/maintain. Unless there's a clean way of merging them, I think keeping them separate makes the most sense for maintenance reasons.

pseudo-rnd-thoughts · 2024-09-02T15:24:21Z

From experience, having tonnes of registered environment can make life easier but it can mean many more environments. For future proofing, an alternative approach is

We can keep, MetaWorld/env-name, this is good.
Then for MT1, an alternative is gym.make("MetaWorld/MT1", env_name="env_name", mode="train/test"), this allows flexibility with env-name (to add more) and mode to easily specify if to train or test.

Similarly for MT10, we can use env_names=[...]

Is there any reason for 1 and 10 only, could we make it MultiTask with env_names that is flexible to any number?

I'm purely spitballing ideas, you don't need to take any of them

reginald-mclean · 2024-09-05T16:37:56Z

@pseudo-rnd-thoughts
I actually implemented something along the lines of what you suggested but forgot to include it. In addition to the above gym.make_vec commands there is also:

gym.make_vec('Meta-World/custom-mt-envs-sync', envs_list=['env_name1-v3', 'env_name2-v3', ...]) # or custom-mt-envs-async
gym.make_vec('Meta-World/custom-ml-envs-sync', envs_list=['env_name1-v3', 'env_name2-v3', ...])# or custom-ml-envs-async

Both of the above commands gives the user control over the environments that they want to use in a multi-task or meta-RL setting, instead of the predefined ones.

I agree, it is a LOT of environments to add, but there are also lots of different use cases of MW environments. Some of them are single environments (ie 'Meta-World/reach-v3'), some of them are the smaller MT/ML environments (ie 'Meta-World/ML-train-reach-v3'), and some of them are the pre-defined environment sets (MT10, MT50, ML10, ML45).

pseudo-rnd-thoughts · 2024-09-18T12:37:53Z

@reginald-mclean In my opinion, I would work to keep the number of environments to a minimal.
Personally, I would only have gym.make for the individual environments, i.e., gym.make("Meta-World/reach-v3")
Alongside the generic MultiTask and MetaTask single and vector environments.
Then finally have the original gym.make("MetaWorld/MT50") to MT10, MT50, ML10, ML45

This reduces the mess you need to maintain and provides more opinions to the users on what they do.
If a user wants a custom multi-task setup with env x, y, z then they can make it without a close but no equivalent version existing within the 100s of environment that could be registered.

Environment parameters are your friend here, minimising what you need to maintain while adding flexibility to users

I say this as I remove over 800 environment from Atari, currently there are over 1000 environments registered for only 100 games. For ALE, there are 14 environments registered for each game which in my opinion is crazy and very few people actually use the extra / special registered environments which can be accessed through parameters.

reginald-mclean · 2024-11-02T01:43:25Z

@pseudo-rnd-thoughts ok I have thought about it and I definitely agree with your comment about having minimal environments. I've moved the environments into their base environments (ie MT1, MT10, MT50), and allowed for the various different features via arguments. I've had a few extra commits slip into this PR, just trying to figure out how to remove them

reginald-mclean and others added 21 commits May 2, 2024 13:17

refactored project and removed v1 files/references/envs

f1a106b

fixing tests from refactor

af157ff

creating envs via gym.make

281a124

pre-commit

3b7da2c

fixing pre-commit

a923060

update to v3

31fd8e2

updates

55ead70

moving wrappers to own folder, fixing ML10/ML45 env creation to match…

7b7038a

… CleanRL method

Merge branch 'removeV1_refactor' of github.com:reginald-mclean/Metawo…

54814df

…rld into reginald-mclean-removeV1_refactor

Merge branch 'reginald-mclean-removeV1_refactor'

cfecdaf

pre-commit

16480c0

updating to use refactored MW

f9190f9

updating gymnasium requirement to install alpha 2 (Farama-Foundation#482

2d953e3

)

docs work (Farama-Foundation#483)

4a315be

Pages work (Farama-Foundation#484)

def7959

missing goal fixes (Farama-Foundation#485)

de0000d

expert actions page

8e29dd5

updates

3f953b9

pre-commit

5703841

merging

14df6bc

pre-commit

852ea81

reginald-mclean added 7 commits August 30, 2024 12:26

pre-commit

2b57f0e

pre-commit

7d37933

pre-commit & type hinting

3cfbad0

update action

3a1d74f

type hinting

19f2062

type hinting again

99ba23a

type hinting again x 2

4c96658

reginald-mclean added 4 commits August 30, 2024 13:05

env-names in env-dict

eda5ba4

updating paths

a63a024

updating tests

0f8d38e

revert mt50 test classes = None

50f9a21

frankroeder force-pushed the addGymMake branch from fd0201e to 50f9a21 Compare September 3, 2024 13:36

rainx0r added 9 commits September 8, 2024 01:21

Refactor gym make env init code

3455d97

Add evaluation code without the buffer

9a96319

Finish first draft of evaluation code

7a3d23c

Test evaluation

fc1dd5b

Fix refactored gym make

9fe624c

Make tests deterministic & finalize eval tests

acfc579

Add missing future imports

befba60

Update ML evaluation tests to not use partial observability

96c52d7

Add gym make tests

e3350d1

rainx0r added 3 commits October 25, 2024 16:36

Merge branch 'addGymMake' into add_evaluation

437b893

Fix minor issues with gym make and eval

32d069c

Finish ML1

d330e05

reginald-mclean force-pushed the addGymMake branch from 506576f to d330e05 Compare November 5, 2024 22:06

reginald-mclean added 6 commits November 5, 2024 18:26

streamlining gym.make for creating envs

836de39

Merge remote-tracking branch 'farama/master' into addGymMake

b677a18

pre-commit

279136c

pre-commit

e1b74d2

updating dependencies to not use head of master gymnasium branch

0b499b7

updating evaluation tests for gym.make changes

b32454e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gym make support for Meta-World envs #498

Add gym make support for Meta-World envs #498

reginald-mclean commented Aug 29, 2024 •

edited

Loading

rainx0r commented Aug 30, 2024

reginald-mclean commented Aug 30, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Sep 2, 2024 •

edited

Loading

reginald-mclean commented Sep 5, 2024

pseudo-rnd-thoughts commented Sep 18, 2024 •

edited

Loading

reginald-mclean commented Nov 2, 2024 •

edited

Loading

Add gym make support for Meta-World envs #498

Are you sure you want to change the base?

Add gym make support for Meta-World envs #498

Conversation

reginald-mclean commented Aug 29, 2024 • edited Loading

rainx0r commented Aug 30, 2024

reginald-mclean commented Aug 30, 2024 • edited Loading

pseudo-rnd-thoughts commented Sep 2, 2024 • edited Loading

reginald-mclean commented Sep 5, 2024

pseudo-rnd-thoughts commented Sep 18, 2024 • edited Loading

reginald-mclean commented Nov 2, 2024 • edited Loading

reginald-mclean commented Aug 29, 2024 •

edited

Loading

reginald-mclean commented Aug 30, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Sep 2, 2024 •

edited

Loading

pseudo-rnd-thoughts commented Sep 18, 2024 •

edited

Loading

reginald-mclean commented Nov 2, 2024 •

edited

Loading