Separate interior state and boundary forcing to only predict state #84

joeloskarsson · 2024-10-31T08:58:20Z

Describe your changes

The goal of this PR is to establish a clear separation of the (predicted) state in the interior region and the boundary forcing coming from outside (and potentially overlapping with) the limited area. This PR performs these changes on the modeling side. That is from each batch is fetched from the dataset class and onward. Including how the tensors are propagated through the model, loss calculation, evaluation and plotting. This should be complemented with a separate PR for handling the data-loading side of things, where the boundary forcing could come from a separate dataset. That change should then build upon #66, once merged.

Note: In order to allow for working on this before the change has been done on the data loading side this currently includes changes in the MEPS npy dataset class that separates state and boundary already in there. This defines the interface between the dataset and model (currently missing #64 from #66, but that can easily be added later) and allows for working on these separately.

After this change:

The model will only predict outputs within the limited area considered
Plots will only include points within the limited area (this could be expanded in a future PR to plot also boundary fields, but would require a mapping between state and boundary forcing dimensions to plot together)
Graphs will have to be created using a boundary mask as in Add a decoding mask option to only include subset of grid nodes in m2g weather-model-graphs#34 to make sure that the g2m-component only maps to the interior nodes.

Dependencies:

This introduces a dependency to https://github.com/mllam/weather-model-graphs. In particular, this dependency should be adjusted before merging to require a version after mllam/weather-model-graphs#34 has been merged.

Issue Link

No issue specific to the separation of interior state. This relates to the overall rework of Reading Training Data, but would be good to put as separate point on roadmap.

This includes graph-creation with wmg (#83).

Type of change

🐛 Bug fix (non-breaking change that fixes an issue)
✨ New feature (non-breaking change that adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
📖 Documentation (Addition or improvements to documentation)

Checklist before requesting a review

My branch is up-to-date with the target branch - if not update your fork with the changes from the target branch (use pull with --rebase option if possible).
I have performed a self-review of my code
For any new/modified functions/classes I have added docstrings that clearly describe its purpose, expected inputs and returned values
I have placed in-line comments to clarify the intent of any hard-to-understand passages of my code
I have updated the README to cover introduced code changes
I have added tests that prove my fix is effective or that my feature works
I have given the PR a name that clearly describes the change, written in imperative form (context).
I have requested a reviewer and an assignee (assignee is responsible for merging). This applies only if you have write access to the repo, otherwise feel free to tag a maintainer to add a reviewer and assignee.

Checklist for reviewers

Each PR comes with its own improvements and flaws. The reviewer should check the following:

the code is readable
the code is well tested
the code is documented (including return types and parameters)
the code is easy to maintain

Author checklist after completed review

I have added a line to the CHANGELOG describing this change, in a section
reflecting type of change (add section where missing):
- added: when you have added new functionality
- changed: when default behaviour of the code has been changed
- fixes: when your contribution fixes a bug

Checklist for assignee

PR is up to date with the base branch
the tests pass
author has added an entry to the changelog (and designated the change as added, changed or fixed)
Once the PR is ready to be merged, squash commits and merge the PR.

joeloskarsson · 2024-11-14T08:29:29Z

This is now ready for a first review. As mentioned in the description this is only changes to the modeling side of things, and I have tweaked the existing MEPS dataloading to be able to test these changes. We will have to see if the changes to dataloading (building on top of #66) should be in a separate PR or just added to this one. I could see both as good solutions.

There are two things preventing tests to pass for this:

This changes the grid_shape_state entry in the config to only mean the interior, not boundary. Thus the grid_shape_state in the reduced MEPS data used for tests is wrong. How/if we want to change this depends on how we want to go about with the ordering of this and Add "datastores" to represent input data from zarr, npy, etc #66.
This introduces graph creation using weather-models-graph, and in particular relies on the masking functionality in Add a decoding mask option to only include subset of grid nodes in m2g weather-model-graphs#34. This needs to be merged for the tests to construct the correct graph.

My idea is that this could get a quick review right now, just considering the current changes, and then we can make up a plan w.r.t. the merging or continued work on this. Don't spend time on the changes to the MEPS dataloading made here, as that will anyhow be replaced with #66.

neural_lam/build_rectangular_graph.py

neural_lam/utils.py

joeloskarsson marked this pull request as draft October 31, 2024 08:59

joeloskarsson mentioned this pull request Oct 31, 2024

Add graph creation functionality using weather-model-graphs #83

Open

3 tasks

joeloskarsson mentioned this pull request Nov 11, 2024

Add a decoding mask option to only include subset of grid nodes in m2g mllam/weather-model-graphs#34

Open

20 tasks

joeloskarsson self-assigned this Nov 14, 2024

joeloskarsson requested review from leifdenby and sadamov November 14, 2024 08:20

joeloskarsson marked this pull request as ready for review November 14, 2024 08:29

joeloskarsson commented Nov 14, 2024

View reviewed changes

neural_lam/build_rectangular_graph.py Show resolved Hide resolved

joeloskarsson commented Nov 14, 2024

View reviewed changes

neural_lam/utils.py Outdated Show resolved Hide resolved

joeloskarsson commented Nov 14, 2024

View reviewed changes

neural_lam/utils.py Show resolved Hide resolved

joeloskarsson added this to the v0.4.0 milestone Nov 20, 2024

leifdenby added 5 commits November 25, 2024 16:42

identified issue, cleanup next

5904cbe

use xarray plot only

efe0302

don't reraise

a489c2e

remove debug plot

242d08b

remove extent calc used in diagnosing issue

c1f706c

joeloskarsson marked this pull request as draft November 27, 2024 11:24

joeloskarsson added 12 commits November 28, 2024 09:04

Start changing dataset to return separate boundary forcing

7692361

Propagate separation of state and boundary change through training loop

efb5326

Start building graphs with wmg

8fe54b4

Change forward pass to concat according to enforced node ordering

ec26afb

wip to make tests pass

ede752d

Fix edge index manipulation to make training work again

0f1e073

Work on fixing plotting functionality

dd8a63a

Linting

b004b19

Add optional separate grid embedder for boundary

2688f56

Make new graph creation script main and only one

609717f

Fix some typos and forgot code

2864013

Correct handling of node indices for m2g when using decode_mask

24bb665

joeloskarsson force-pushed the boundary_forcing branch from b89a4e6 to 24bb665 Compare November 28, 2024 09:37

Linting and bugfixes

bc21b73

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate interior state and boundary forcing to only predict state #84

Separate interior state and boundary forcing to only predict state #84

joeloskarsson commented Oct 31, 2024 •

edited by sadamov

Loading

joeloskarsson commented Nov 14, 2024

Separate interior state and boundary forcing to only predict state #84

Are you sure you want to change the base?

Separate interior state and boundary forcing to only predict state #84

Conversation

joeloskarsson commented Oct 31, 2024 • edited by sadamov Loading

Describe your changes

After this change:

Dependencies:

Issue Link

Type of change

Checklist before requesting a review

Checklist for reviewers

Author checklist after completed review

Checklist for assignee

joeloskarsson commented Nov 14, 2024

joeloskarsson commented Oct 31, 2024 •

edited by sadamov

Loading