Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a visualisation approach for the cross validation folds #48

Open
JulianBiesheuvel opened this issue Aug 9, 2024 · 1 comment
Labels
enhancement New feature or request low priority

Comments

@JulianBiesheuvel
Copy link
Collaborator

Previously, when we used fixed date ranges for the summer and winter periods, it was easy to visualise with a bar diagram how many samples of each season and annually were available in each fold. Now, with variable date ranges this is not possible anymore. Another method should be developed to show this to the user.

@JulianBiesheuvel JulianBiesheuvel added enhancement New feature or request low priority labels Aug 9, 2024
@khsjursen
Copy link
Collaborator

You mean this figure?
image

Maybe a solution could be to plot the distribution of time ranges in training and validation in each fold (e.g. as box plot)? And a count of which months covered by the data? A measurement with to/from date 10 Oct to 20 May would roughly cover Oct, Nov, Dec, Jan, Feb, Mar, Apr, May, and count towards these months. A measurement of annual mb would count towards all months.

The first plot would illustrate how aggregated the data is while the second would illustrate how well winter vs. summer seasons are represented.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request low priority
Projects
None yet
Development

No branches or pull requests

2 participants