can not get the paper result #3

softmicro929 · 2019-01-20T11:21:01Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

etleader · 2019-01-25T15:19:04Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

I hava the same question, what's your specific condition? When i train the model, the reward almost don't change. When i test the TMs, i find the training as if never learned sth.

softmicro929 · 2019-02-10T13:35:57Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

I hava the same question, what's your specific condition? When i train the model, the reward almost don't change. When i test the TMs, i find the training as if never learned sth.

yes, learn nothing, but you should fix the TM when testing over the training stage

etleader · 2019-02-10T13:58:22Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

I hava the same question, what's your specific condition? When i train the model, the reward almost don't change. When i test the TMs, i find the training as if never learned sth.

yes, learn nothing, but you should fix the TM when testing over the training stage

Sorry to bother you, i don't really understand what u mean? How to fix the TMs?

softmicro929 · 2019-02-11T07:58:36Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

I hava the same question, what's your specific condition? When i train the model, the reward almost don't change. When i test the TMs, i find the training as if never learned sth.

yes, learn nothing, but you should fix the TM when testing over the training stage

Sorry to bother you, i don't really understand what u mean? How to fix the TMs?

the author's code didn't do testing , so you have to write test code by yourself to get the fig result in paper.

Lui-Chiho · 2019-04-24T14:06:49Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

Sorry to bother you！
Have You get the Fig. 1 result ? I still can't understand how to use the TMs mentioned in the paper to train this DRL-Agent . Can you explain the whole training process, because in the given code , I did not find any correlation between the previous state and the new state, It seems that they are all randomly generated using np.random.

wqhcug · 2019-04-26T14:18:56Z

have you anyone get the fig2 result in paper? my model doesn't convergence。

Sorry to bother you！
Have You get the Fig. 1 result ? I still can't understand how to use the TMs mentioned in the paper to train this DRL-Agent . Can you explain the whole training process, because in the given code , I did not find any correlation between the previous state and the new state, It seems that they are all randomly generated using np.random.

Excuse me. Also have similar question, I can't understand why the state(TMs) and the new_state(TMs) are randomly generated in the step function which in Environment.py .It isn't meeting the logic of DRL.

FaisalNaeem1990 · 2019-05-16T18:09:31Z

Can any get the result of the same as in paper as the model is not converging

CZMG · 2019-05-29T13:10:20Z

I have the same question. I dont't understand why the old state and the new state are randomly generated in the Environment.py.

FaisalNaeem1990 · 2019-05-29T14:11:53Z

Did you run the whole simulations or not.

…

On Wednesday, May 29, 2019, CZMG ***@***.***> wrote: I have the same question. I dont't understand why the old state and the new state are randomly generated in the Environment.py. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#3?email_source=notifications&email_token=AMC5WSNJIYYDIC37B2Y2SEDPXZ6D3A5CNFSM4GRGLWCKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODWPIXII#issuecomment-496929697>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMC5WSOD2RZMH5N4WAZV5VDPXZ6D3ANCNFSM4GRGLWCA> .

wqhcug · 2019-05-29T14:27:10Z

I have the same question. I dont't understand why the old state and the new state are randomly generated in the Environment.py.
Did you run the whole simulations or not.
…

Excuse me. I run the whole simulations. But in my daily study, the STATE of Reinforcement Learning is usually changed by the ACTION, but in the code of this paper, we can find flie that in Environment.py, its NEW STATE and OLD STATE are randomly generated, which does not seem to meet the logic of Reinforcement Learning. Which teacher can answer my confusion? Thank you very much.

ljh14 · 2019-12-05T06:55:49Z

I have the same question. I dont't understand why the old state and the new state are randomly generated in the Environment.py.
Did you run the whole simulations or not.
…

Excuse me. I run the whole simulations. But in my daily study, the STATE of Reinforcement Learning is usually changed by the ACTION, but in the code of this paper, we can find flie that in Environment.py, its NEW STATE and OLD STATE are randomly generated, which does not seem to meet the logic of Reinforcement Learning. Which teacher can answer my confusion? Thank you very much.

I've also found this question. I think that the author need to do some explanations. It disobey the basic logic of reinforcement learning. @gissimo

slblbwl · 2023-11-02T05:10:41Z

hello，Please ask how I can run the whole simulation, can you tell me the approximate steps, thank you very much！

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can not get the paper result #3

can not get the paper result #3

softmicro929 commented Jan 20, 2019

etleader commented Jan 25, 2019

softmicro929 commented Feb 10, 2019

etleader commented Feb 10, 2019

softmicro929 commented Feb 11, 2019

Lui-Chiho commented Apr 24, 2019

wqhcug commented Apr 26, 2019

FaisalNaeem1990 commented May 16, 2019

CZMG commented May 29, 2019

FaisalNaeem1990 commented May 29, 2019 via email

wqhcug commented May 29, 2019

ljh14 commented Dec 5, 2019

slblbwl commented Nov 2, 2023

can not get the paper result #3

can not get the paper result #3

Comments

softmicro929 commented Jan 20, 2019

etleader commented Jan 25, 2019

softmicro929 commented Feb 10, 2019

etleader commented Feb 10, 2019

softmicro929 commented Feb 11, 2019

Lui-Chiho commented Apr 24, 2019

wqhcug commented Apr 26, 2019

FaisalNaeem1990 commented May 16, 2019

CZMG commented May 29, 2019

FaisalNaeem1990 commented May 29, 2019 via email

wqhcug commented May 29, 2019

ljh14 commented Dec 5, 2019

slblbwl commented Nov 2, 2023