Algorithm results about PDQN/HPPO in gym-hybrid #3

PaParaZz1 · 2021-10-12T10:38:59Z

Hi, this is a nice project for hybrid action space, and I see you mentioned PDQN/HPPO in README.md. Do you have some experiment results about these algorithms in this environment. If not, we want to invite you to implement related algorithms and benchmarks in our repo DI-engine together, we will offer corresponding supports for you. Do you have will to construct a hybrid action space RL benchmark? Other comments are also welcome.

The text was updated successfully, but these errors were encountered:

PaParaZz1 · 2021-10-26T12:21:06Z

We implement PADDPG in your gym-hybrid env in this link

thomashirtz · 2021-10-27T08:41:29Z

Thank you very much for your feedback!
Unfortunately these days I am very busy and I cannot take care of it.
I did implement P-QLearning in my q-learning-algorithms in the past, I do not remember if it converged or the score.

Note: Algorithms are now using architectures that needs to know the which parameters are related to which action (e.g. MP-DQN). I think it may be better to change the way to handle the observation space. I am not completely sure yet what is the best way to do it. Even though it would definitely future-proof the repository, it would also break any agent that used this env...
gym-platform is using one tuple of space per parameter-action pair, didn't test how inconvenient it is to have empty tuple (e.g. for breaking).

thomashirtz self-assigned this Dec 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Algorithm results about PDQN/HPPO in gym-hybrid #3

Algorithm results about PDQN/HPPO in gym-hybrid #3

PaParaZz1 commented Oct 12, 2021

PaParaZz1 commented Oct 26, 2021 •

edited

Loading

thomashirtz commented Oct 27, 2021

Algorithm results about PDQN/HPPO in gym-hybrid #3

Algorithm results about PDQN/HPPO in gym-hybrid #3

Comments

PaParaZz1 commented Oct 12, 2021

PaParaZz1 commented Oct 26, 2021 • edited Loading

thomashirtz commented Oct 27, 2021

PaParaZz1 commented Oct 26, 2021 •

edited

Loading