Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

can amp model here reproduce the result of different tasks of ase? #31

Open
xjturobocon opened this issue Feb 16, 2023 · 11 comments
Open

Comments

@xjturobocon
Copy link

xjturobocon commented Feb 16, 2023

I try to use the following command to train amp model to complete the task just like location
python3 ase/run.py --task HumanoidStrike --cfg_env ase/data/cfg/humanoid_sword_shield_strike.yaml --cfg_train ase/data/cfg/train/rlg/amp_humanoid_task.yaml --motion_file ase/data/motions/reallusion_sword_shield/dataset_strike_amp.yaml

However, the result is worse than which produced by ase. For example, The agent trembles when walking. Do I need to adjust some parameters?

@xbpeng
Copy link
Collaborator

xbpeng commented Feb 18, 2023

Do you have a video of the behavior from the model? The strike task can be a bit challenging, and the behavior of the model can vary quite a bit between different runs. Have you try just training again?

@xjturobocon
Copy link
Author

here is a demo of location task. I try to train heading, location and strike, the common problem is the "locomotion" skill of these task, just like the demo.

amp_task_location-2023-02-20_11.15.21.mp4

@xbpeng
Copy link
Collaborator

xbpeng commented Feb 20, 2023

In principal it should be possible to use AMP for these tasks as well, but I haven't really tried tuning AMP for the tasks in this codebase before. Lookin at the video, it kind of seems like the weigh on the discriminator reward might need to be higher in order to encourage more natural behaviors:

@Murrol
Copy link

Murrol commented Mar 9, 2023

In my case, higher disc learning rate while lower disc update frequency might help, in addition to balance reward weights.

@xjturobocon
Copy link
Author

In my case, higher disc learning rate while lower disc update frequency might help, in addition to balance reward weights.

Can you show me a copy of your setting? Thank you.

@xizaoqu
Copy link

xizaoqu commented Mar 23, 2023

Maybe dataset_strike_amp.yaml doesn't contain enough locomotion clips? Replacing it by dataset_reallusion_sword_shield.yaml seems work well.

@xjturobocon
Copy link
Author

Maybe dataset_strike_amp.yaml doesn't contain enough locomotion clips? Replacing it by dataset_reallusion_sword_shield.yaml seems work well.

I also used all motions to train amp location task, but it also didn't work. Have you try it?

@xizaoqu
Copy link

xizaoqu commented Mar 23, 2023

Maybe dataset_strike_amp.yaml doesn't contain enough locomotion clips? Replacing it by dataset_reallusion_sword_shield.yaml seems work well.

I also used all motions to train amp location task, but it also didn't work. Have you try it?

strick.mp4

@xizaoqu
Copy link

xizaoqu commented Mar 23, 2023

image

@xjturobocon
Copy link
Author

image

Can you give me a contact information(WeChat or something else)? I want to ask you for specific information. Thank you.

@xizaoqu

This comment was marked as outdated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants