FIRE action wrapper from gym_wrappers #768

jbuckman · 2021-11-09T05:32:06Z

jbuckman
Nov 9, 2021

Hi, can someone please explain to me the following logic? From https://github.com/PyTorchLightning/lightning-bolts/blob/master/pl_bolts/models/rl/common/gym_wrappers.py#L45. My understanding is that this is intended to fix the issue of some games not starting unless the agent takes the FIRE action, leading to some agents staying in the start state forever. But what are these lines of code actually doing?

    def reset(self):
        """reset the env."""
        self.env.reset()
        obs, _, done, _ = self.env.step(1)
        if done:
            self.env.reset()
        obs, _, done, _ = self.env.step(2)
        if done:
            self.env.reset()
        return obs

I don't understand the need to do 3 resets, the choices of actions 1 and 2, and the if done clauses. Also, I don't see how this fully fixes the issue, since for many environments (for example Breakout) the FIRE action must be taken once again after each life loss. Can anyone explain?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIRE action wrapper from gym_wrappers #768

{{title}}

Replies: 0 comments

Select a reply

FIRE action wrapper from gym_wrappers #768

jbuckman Nov 9, 2021

Replies: 0 comments

jbuckman
Nov 9, 2021