Create a visualisation approach for the cross validation folds #48

JulianBiesheuvel · 2024-08-09T18:13:49Z

Previously, when we used fixed date ranges for the summer and winter periods, it was easy to visualise with a bar diagram how many samples of each season and annually were available in each fold. Now, with variable date ranges this is not possible anymore. Another method should be developed to show this to the user.

khsjursen · 2024-08-10T09:04:01Z

You mean this figure?

Maybe a solution could be to plot the distribution of time ranges in training and validation in each fold (e.g. as box plot)? And a count of which months covered by the data? A measurement with to/from date 10 Oct to 20 May would roughly cover Oct, Nov, Dec, Jan, Feb, Mar, Apr, May, and count towards these months. A measurement of annual mb would count towards all months.

The first plot would illustrate how aggregated the data is while the second would illustrate how well winter vs. summer seasons are represented.

JulianBiesheuvel added enhancement New feature or request low priority labels Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a visualisation approach for the cross validation folds #48

Create a visualisation approach for the cross validation folds #48

JulianBiesheuvel commented Aug 9, 2024

khsjursen commented Aug 10, 2024

Create a visualisation approach for the cross validation folds #48

Create a visualisation approach for the cross validation folds #48

Comments

JulianBiesheuvel commented Aug 9, 2024

khsjursen commented Aug 10, 2024