Fix comparison plot creation logic #87

nfb2021 · 2024-10-17T15:39:48Z

A preparation for stability metrics & improvement of intra-annual metrics

The logic used to create the clustered boxplots to compare a given metric for all temporal sub-windows present was changed.

Before, each temporal sub-window box in the boxplot could be calculated over potentially different GPIs. This would happen if the test area was masked during certain months or seasons, e.g. for snow or frozen soil. As as result, a direct comparison of these boxes was difficult.

Now, a spatial subset is chosen so that the same GPIs with valid data (=based on a predefined threshold for amount of non-NaNs) are used for all temporal sub-windows present. The exception are temporal sub-windows that have so many NaNs, that they are above the threshold. These temporal sub-windows are then entirely set to Nan and no box is plotted for them, even though the temporal sub-window names will appear in the final plot.

This ensures a more robust comparability between different temporal sub-windows for a given metric.

…mer will be deprecated

…pplymap to DataFram.map

coveralls · 2024-10-17T15:59:20Z

Pull Request Test Coverage Report for Build 11388336978

Details

36 of 37 (97.3%) changed or added relevant lines in 3 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.2%) to 81.405%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/qa4sm_reader/plotter.py	32	33	96.97%

Totals
Change from base Build 11274863906:	0.2%
Covered Lines:	2386
Relevant Lines:	2801

💛 - Coveralls

wpreimes · 2024-10-18T11:07:40Z

you can merge this from my side if you think this is the correct approach. Generally, I wouldn't cherry-pick the results for the plots, because it makes it difficult to interpret the results. Plots should just visualize the results as they are in the netcdf file, people should be able to make their own versions of the plots from the results. If the coverage differs over the year, that should also be reflected in the plots (that's why we report the number of points with the boxes usually and have a separate nobs boxplot). For stability I guess it could make sense, in order to compute a meaningful slope, but then it probably should already be done as part of the metric calculation, rather than the visualisation.

nfb2021 · 2024-10-18T11:48:24Z

you can merge this from my side if you think this is the correct approach. Generally, I wouldn't cherry-pick the results for the plots, because it makes it difficult to interpret the results. Plots should just visualize the results as they are in the netcdf file, people should be able to make their own versions of the plots from the results. If the coverage differs over the year, that should also be reflected in the plots (that's why we report the number of points with the boxes usually and have a separate nobs boxplot). For stability I guess it could make sense, in order to compute a meaningful slope, but then it probably should already be done as part of the metric calculation, rather than the visualisation.

I get your point and partially agree. It does facilitate a direct comparison though, if it is guaranteed, that always the same GPIs are used. Even though good data from other GPIs for some temporal sub-windows will be disregarded.

This approach will be revisited for the stability metrics, though, anyhow.

nfb2021 added 8 commits October 17, 2024 15:26

comparison boxplots are now crated over the exact same gpis for all tsw

8cf468f

comparison boxplots are now crated over the exact same gpis for all tsw

d569339

moved naming of comparison boxplot outdir to globals

9a2fa03

changed pandas.DataFrame.applymap to pandas.DataFrame.map, as the for…

b910328

…mer will be deprecated

corrected spelling

1b95790

corrected title creation of comparison boxplots and undid DataFrame.a…

45bf703

…pplymap to DataFram.map

reverted to original formatting

c9817f2

reverted to DataFrame.applymap

234da86

nfb2021 requested review from wpreimes and daberer October 17, 2024 15:39

nfb2021 merged commit 650946f into awst-austria:master Oct 18, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix comparison plot creation logic #87

Fix comparison plot creation logic #87

nfb2021 commented Oct 17, 2024

coveralls commented Oct 17, 2024

wpreimes commented Oct 18, 2024 •

edited

Loading

nfb2021 commented Oct 18, 2024

Fix comparison plot creation logic #87

Fix comparison plot creation logic #87

Conversation

nfb2021 commented Oct 17, 2024

A preparation for stability metrics & improvement of intra-annual metrics

coveralls commented Oct 17, 2024

Pull Request Test Coverage Report for Build 11388336978

Details

💛 - Coveralls

wpreimes commented Oct 18, 2024 • edited Loading

nfb2021 commented Oct 18, 2024

wpreimes commented Oct 18, 2024 •

edited

Loading