Skip to content

Commit

Permalink
updated the link to Four Rooms environment
Browse files Browse the repository at this point in the history
  • Loading branch information
amsks committed Nov 9, 2023
1 parent 55c9e56 commit 53f07be
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion rl_exercises/week_4/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,4 +19,4 @@ Use the [Hydra SMAC sweeper](https://github.com/automl/hydra-smac-sweeper.git) t

## Level 3
### Implementing TD($\lambda$)
In the same format as the SARSA code, implement the TD($\lambda(n)$) algorithm on the [Gridcore environment](https://github.com/automl/TabularTempoRL/blob/master/grid_envs.py). Make $n$ a configurable parameter signifying the number of lookahead steps. Try to ablate the peformance for multiple values of $n$ and verify the theoretical claims in the lecture.
In the same format as the SARSA code, implement the TD($\lambda(n)$) algorithm on etiher the [Gridcore environment](https://github.com/automl/TabularTempoRL/blob/master/grid_envs.py), or the [Four Rooms environment](https://github.com/Farama-Foundation/Minigrid/blob/master/minigrid/envs/fourrooms.py). Make $n$ a configurable parameter signifying the number of lookahead steps. Try to ablate the peformance for multiple values of $n$ and verify the theoretical claims in the lecture.

0 comments on commit 53f07be

Please sign in to comment.