125/96 Episodes with malfunction for benchmarking and regression tests. 8 Policy abstraction. #131

chenkins · 2025-02-23T17:23:32Z

Changes

Extract Trajectory data structure from benchmark_episodes.py.
Deprecate duplicate env_generator in rail_env_utils.py. Add initial observations and infos from reset() to env_generator interface
Add cli for generating trajectories from policy (policy abstraction)
Add cli for validating and evaluating trajectories
data model and flow documentation

Related issues

Closes #125
Closes to #8
Closes #96

Checklist

Tests are included for relevant behavior changes.
Documentation is added in the flatland-book repo for relevant behavior changes.
If you made important user-facing changes, describe them under the [Unreleased] tag in CHANGELOG.md.
New package dependencies are declared in the pyproject.toml file.
Requirement files have been updated by running tox -e requirements.
Code works with all supported Python versions (3.10, 3.11 and 3.12). Checks run with all three version and are
required to run successfully.
Code is formatted according to PEP 8 (an IDE like PyCharm can do this for you).
Technical guidelines listed in CONTRIBUTING.md are followed.

…bservations and infos from reset() to env_generator interface.

…ut loading the reset env directly..

…tic policy

flatland/trajectories/trajectories.py

chenkins · 2025-02-26T15:07:26Z

flatland/trajectories/trajectories.py

+
+
+class Policy:
+    def act(self, handle: int, observation: Any, **kwargs) -> RailEnvActions:


@manuschn Not sure we want/need the handle as well? The policy's acting should be based on the observation only and not depend on the handle?

aiAdrian

Do we break all python version below 3.9 !!!!
TypeError: Type subscription requires python >= 3.9

(flatland-rl-test) u216993@K57156:~/flatland/test/flatland_solver_policy$ python example/flatland_dynamics/example_flatland_dynamics.py
Traceback (most recent call last):
File "example/flatland_dynamics/example_flatland_dynamics.py", line 4, in
from environment.flatland_railway_extension.flatland_dynamics import FlatlandDynamicsEnvironment
File "/home/u216993/flatland/test/flatland_solver_policy/environment/flatland_railway_extension/flatland_dynamics.py", line 3, in
from flatland_railway_extension.FlatlandEnvironmentHelper import FlatlandEnvironmentHelper
File "/home/u216993/flatland/test/flatland_railway_extension/flatland_railway_extension/FlatlandEnvironmentHelper.py", line 6, in
from flatland.envs.malfunction_generators import MalfunctionParameters, ParamMalfunctionGen
File "/home/u216993/flatland/test/flatland-rl/flatland/envs/malfunction_generators.py", line 8, in
from flatland.envs import persistence
File "/home/u216993/flatland/test/flatland-rl/flatland/envs/persistence.py", line 10, in
from flatland.envs import rail_env
File "/home/u216993/flatland/test/flatland-rl/flatland/envs/rail_env.py", line 28, in
from flatland.utils import seeding
File "/home/u216993/flatland/test/flatland-rl/flatland/utils/seeding.py", line 97, in
HashableRandomState = Tuple[str, np.ndarray[np.uint], int, int, float]
TypeError: Type subscription requires python >= 3.9

…to 125-episodes-refactoring

chenkins · 2025-03-10T12:45:29Z

flatland/envs/persistence.py

+        # it's not sufficient to store random_seed, as seeding from random_seed is done
+        # at start of reset (before rail/line/timetable (re-)generation,
+        # hence np_random depends on rail/line/timetable generation
+        # TODO would it be better to have env_generation without reset? Conceptually, the env should be initialised with a state incl. the seed


Discuss/resolve TODO.
Current approach: reset calls rail/line/timetable generators -> random state at start depends on these generators.
Desireable approach (anticipated by env_generator): the env starts is already reset in initial state, therefore need to persist np_random after reset is called.

Is the current implementation self-consistent? Add test

save env without reset

save env after reset
load both and reset w/ and w/o reset -> the result should be the same in all 4 cases. Is this enough?

chenkins · 2025-03-10T12:46:14Z

flatland/trajectories/README.md

+============
+
+
+TODO move to `flatland-book`?


@manuschn are these diagrams helpful ?

chenkins · 2025-03-10T12:49:25Z

flatland/trajectories/README.md

+
+```
+
+Flow Env Step


@manuschn idea is to refactor step/reset accordingly reflecting this flow structure -> #138

…to 125-episodes-refactoring

Extract Trajectory data structure from benchmark_episodes.py.

d6d8f2b

chenkins added this to the 4.0.6 milestone Feb 23, 2025

chenkins added 3 commits February 23, 2025 20:39

Deprecate duplicate env_generator in rail_env_utils.py. Add initial o…

be366cb

…bservations and infos from reset() to env_generator interface.

Add running submission to generate a trajectory (WiP).

ddbad94

Add Mermaid data model (WiP).

93f5e08

chenkins changed the title ~~Extract Trajectory data structure from benchmark_episodes.py.~~ 125 Extract Trajectory data structure from benchmark_episodes.py. Feb 23, 2025

chenkins changed the title ~~125 Extract Trajectory data structure from benchmark_episodes.py.~~ 125 Episodes with malfunction for benchmarking and regression tests. Policy abstraction. Feb 23, 2025

chenkins added 15 commits February 24, 2025 10:10

Verify running submission to generate a trajectory (WiP).

4c0f467

Add snapshoting

8ecdcb2

Add progress bar.

d38e532

Cleanup.

4988c44

Persist rail_env's np_random in order not to rely on calling reset, b…

6dc4d6a

…ut loading the reset env directly..

Add cli.

ffed198

Remove position collection from Trajectory.run().

88b1004

Cleanup.

3ef0240

Re-encode regression episodes.

87e0694

Add trajectory generation from metadata.csv.

ce795d7

Generate trajectories with malfunction from deadlock avoidance heuris…

9b7ee73

…tic policy

Run installed demo cli in tox.

081beec

Fix typo.

d5f1fbe

Add flow chart for Trajectory Runs.

499c2ec

Add flow chart env.reset and env.step

583c007

chenkins marked this pull request as ready for review February 26, 2025 10:06

chenkins requested a review from a team as a code owner February 26, 2025 10:06

chenkins changed the title ~~125 Episodes with malfunction for benchmarking and regression tests. Policy abstraction.~~ 125/96 Episodes with malfunction for benchmarking and regression tests. 8 Policy abstraction. Feb 26, 2025

Cleanup.

cc88aae

chenkins commented Feb 26, 2025

View reviewed changes

flatland/trajectories/trajectories.py Show resolved Hide resolved

chenkins commented Feb 26, 2025

View reviewed changes

aiAdrian approved these changes Feb 26, 2025

View reviewed changes

chenkins mentioned this pull request Mar 3, 2025

Refactor step()/reset(). #138

Open

aiAdrian reviewed Mar 4, 2025

View reviewed changes

chenkins requested a review from a team March 6, 2025 20:11

chenkins added 3 commits March 6, 2025 21:14

Use flatland-scenarios instead of data.flatland.cloud for trajectories.

69391c8

Merge branch 'main' of github.com:flatland-association/flatland-rl in…

3fbf119

…to 125-episodes-refactoring

Merge branch 'main' of github.com:flatland-association/flatland-rl in…

3f319fb

…to 125-episodes-refactoring

chenkins modified the milestones: 4.0.6, 4.0.5 Mar 10, 2025

chenkins commented Mar 10, 2025

View reviewed changes

chenkins mentioned this pull request Mar 10, 2025

140 Rail, Line and Timetable from File Generators. #141

Open

7 tasks

Merge branch 'main' of github.com:flatland-association/flatland-rl in…

85556cc

…to 125-episodes-refactoring

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

125/96 Episodes with malfunction for benchmarking and regression tests. 8 Policy abstraction. #131

125/96 Episodes with malfunction for benchmarking and regression tests. 8 Policy abstraction. #131

chenkins commented Feb 23, 2025 •

edited

Loading

chenkins Feb 26, 2025

aiAdrian left a comment

chenkins Mar 10, 2025 •

edited

Loading

chenkins Mar 10, 2025 •

edited

Loading

chenkins Mar 10, 2025



		class Policy:
		def act(self, handle: int, observation: Any, **kwargs) -> RailEnvActions:


		```

		Flow Env Step

125/96 Episodes with malfunction for benchmarking and regression tests. 8 Policy abstraction. #131

Are you sure you want to change the base?

125/96 Episodes with malfunction for benchmarking and regression tests. 8 Policy abstraction. #131

Conversation

chenkins commented Feb 23, 2025 • edited Loading

Changes

Related issues

Checklist

chenkins Feb 26, 2025

Choose a reason for hiding this comment

aiAdrian left a comment

Choose a reason for hiding this comment

chenkins Mar 10, 2025 • edited Loading

Choose a reason for hiding this comment

chenkins Mar 10, 2025 • edited Loading

Choose a reason for hiding this comment

chenkins Mar 10, 2025

Choose a reason for hiding this comment

chenkins commented Feb 23, 2025 •

edited

Loading

chenkins Mar 10, 2025 •

edited

Loading

chenkins Mar 10, 2025 •

edited

Loading