-
Notifications
You must be signed in to change notification settings - Fork 4
Pull requests: xeviknal/aidl-2021-wo-rl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Final experiments - REINFORCE] experiment #1 with seed 7081960
#67
opened Apr 18, 2021 by
jaimepedretp
Loading…
[Final experiments - RL-Baseline] experiment #2 with seed 1000
#66
opened Apr 18, 2021 by
jaimepedretp
Loading…
Hyperparam tunning - few epochs, high vf coeff (c1) - medium entropy coeff (c2)
#64
opened Apr 18, 2021 by
xeviknal
Loading…
[PPO - early-step / green penalty] Value Function coeff c1 = 2.0, entropy coeff c2 = 0.08
#63
opened Apr 18, 2021 by
xeviknal
Loading…
[Final experiments - RL-Baseline] experiment #3 with seed 190421
#58
opened Apr 15, 2021 by
ziritrion
Loading…
[Final experiments - RL-Baseline] experiment #1 with seed 7081960
#57
opened Apr 15, 2021 by
ziritrion
Loading…
[Final experiments - REINFORCE] experiment #3 with seed 190421
#53
opened Apr 13, 2021 by
ziritrion
Loading…
[Final experiments - REINFORCE] experiment #2 with seed 1000
#52
opened Apr 13, 2021 by
ziritrion
Loading…
PPO-early-stop: finish the episode after 50 steps if avg reward is negative
#51
opened Apr 10, 2021 by
xeviknal
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.