Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run ./TerrainRL_Optimizer -arg_file= args/dog_slopes_mixed_args.txt command #32

Open
wenyijiang opened this issue May 21, 2017 · 10 comments

Comments

@wenyijiang
Copy link

Hi !
When I run ./TerrainRL_Optimizer -arg_file= args/dog_slopes_mixed_args.txt command, my computer is computing something. It's about Episodes、Cycles、Avg dist. I want to know what are these about?
Thanks a lot ! Best wishes !

@xbpeng
Copy link
Owner

xbpeng commented May 21, 2017

that arg file is mainly intended to be used by the TerrainRL.exe app. It doesn't do any training and just runs a policy. You should use the args with "train" in the name for training.

@wenyijiang
Copy link
Author

I want to train .When I run ./TerrainRL_Optimizer -arg_file= args/opt_args_train_mace.txt command, it shows :

[libprotobuf ERROR google/protobuf/text_format.cc:274] Error parsing text-format caffe.NetParameter: 14:19: Message type "caffe.MemoryDataParameter" has no field named "label_size".
F0522 11:48:02.638092 14423 upgrade_proto.cpp:88] Check failed: ReadProtoFromTextFile(param_file, param) Failed to parse NetParameter file: data/policies/dog/nets/dog_mace3_train.prototxt

could you please tell me what the problem it is in the execution of training? Except training, the program runs normally.
Thanks a lot !

@nwcora
Copy link

nwcora commented Sep 10, 2017

Hello, i want to know when i run the ./TerrainRL_Optimizer -arg_file= args/dog_slopes_mixed_args.txt
the terminal displays like this
Action: 2
Val: (0.993) 0.989 0.961 0.993
so what does those parameters mean ?
and if it is running on a policy ,why there are not any scenes of the simulation just like when i run the ./TerrainRL -arg_file= args/sim_dog_args.txt.how can i see the simulation scenes?

@xbpeng
Copy link
Owner

xbpeng commented Sep 10, 2017

Those parameters are outputting the values of each critic, and shows which actor was selected. In this case, Actor 2 is selected since it's corresponding critic has the highest value. TerrainRL_Optimizer, does not support any rendering, since it is mainly used for offline training. TerrainRL is the app for visualizing the policies.

@nwcora
Copy link

nwcora commented Sep 10, 2017

i have run the TerrainRL -dog_slopes_mixed.txt ,so there is not any training process? and it's just running a policy?where is the policy network?what‘s the difference between these two files?dog_slopes_mixed_args.txt dog_slopes_mixed.txt

@xbpeng
Copy link
Owner

xbpeng commented Sep 10, 2017

Yes that i jut running a policy, for training, uses the opt_args_train_*.txt files. You can find which policy those files are running by looking at the path specified by "-policy_model=".

@nwcora
Copy link

nwcora commented Sep 10, 2017

Yeah,you are right,i thought the policy must be what you say,but i dont understand the policy file which is .h5 file ,what does those binaries mean? in my opinion,the policy is something like target joints,etc.so i am really confused.maybe they are the network parameters?

@xbpeng
Copy link
Owner

xbpeng commented Sep 10, 2017

the policy files contain the network weights, the output actions from the policies consist of joint angle and other parameters for the FSM.

@nwcora
Copy link

nwcora commented Sep 11, 2017

I see ,but in the args/sim_dog_args.txt .there isn't any policy,it seems the control is just the default output action,right?In the dog_character file,there are three controllers:fast run,slow run and jump,i guess they are used for forming the initial actions.what do the dog_motion file and state_file representate? the motion file is 24-dimension and forms a loop,maybe the action loop?.and the state_file is 23-d on both pose and velocity,i am not sure what the numbers stands for .

@xbpeng
Copy link
Owner

xbpeng commented Sep 11, 2017

yes, args/sim_dog_args.txt doesn't have a policy, it is just running the default FSM. Yes, the dog_character file specifies the initial actions for the fast run, slow run, and jump. The motion file were used for some other things we were working on, so they are not relevant. The state file represents the initial state of the character at the start of each episode (pose and velocity).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants