I38 generic var templates #40

QSparks · 2024-10-11T20:55:01Z

Description

This PR introduces new classes and functionality to handle generic scalar and vector climate variables, allowing users to compute statistics for a broader range of climate data.

Generic Scalar and Vector Classes

Added ClimdexGenericScalar and ClimdexGenericVector classes for scalar variables (e.g., humidity, snow depth) and vector variables (e.g., wind speed and direction).
Both classes support raw data input and reading from CSV files.

Scalar Data Handling

compute.stat.scalar() function to compute statistics (mean, max, min, sum, sd, var) for scalar data.
Support for exact date calculations for max and min statistics.
Functions to read scalar data from CSV (climdexGenericScalar.csv) and construct raw scalar objects (climdexGenericScalar.raw).

Vector Data Handling

compute.stat.vector() function for vector data, supporting statistics: max, min, mean, sum, circular_mean, sd, and circular_sd.
Supports formats: "polar", "cartesian", and "cardinal".
Direction-based filtering using direction.range.
Functions to read vector data from CSV (climdexGenericVector.csv) and construct raw vector objects (climdexGenericVector.raw).

Circular Statistics Support

Added compute_circular_mean() and compute_circular_sd() for directional data (e.g., wind direction), using the circular package.

Data Filtering by Direction

Added filter_by_direction_range() to filter vector data based on direction range.

Utility Functions

Conversion functions:
- convert_cartesian_to_polar()
- convert_polar_to_cartesian()
- convert_degrees_to_cardinal()
  for vector data in different coordinate formats.

Tests Added

Unit tests for scalar and vector statistics:
- Scalar statistics: mean, sum, sd, var, max, min, and exact date support.
- Vector statistics for magnitude and direction, including circular_mean and circular_sd.
- Direction-based filtering and support for different vector formats.
- NA handling.

This commit adds the 'as.df' parameter to relevant functions, allowing users to choose whether results should be returned as data frames, including the exact date, or start and end dates of climate extremes. Additionally, utility functions like 'ymd.dates' and 'exact.date' are introduced for date handling.

rod-glover

Great work, Quintin! The code is very clean and clear.

The tests look solid, a lot of obvious cases covered, but I have not dug in to think through all the non-obvious ones. Are there any that occur to you after the fact?

A kind of generic question: Is there a prettifier/formatter for R? It seems possible it might be useful to standardize the code format, as I see a few indentation mistakes/inconsistencies and the like. (For application in a separate PR, I would think.)

R/generic_stats.R

rod-glover · 2024-10-28T18:31:55Z

R/generic_stats.R

+#' If `include.exact.dates = TRUE`, exact dates are returned for max/min statistics.
+#' The function can also filter the data based on a specified degree range of directions (using `direction.range`).
+#'
+#' @seealso \code{\link{compute.stat.scalar}}, \code{\link{compute.gen.stat}}


It's possible something might be a bit wonky with the docs ... this doesn't look as if it corresponds to what's in the doc PDF you sent me, see pp 64-65 of that document.

Would you mind providing a few more details on this? If it pertains to section reordering, that behavior is expected from roxygen.

Everything looks right down to the Details section bottom of p. 64. Then text reading

Value A list containing the computed statistic for magnitude and, if applicable, the computed statistic for direction. For circular statistics (e.g., circular_mean, circular_sd), the result is returned for directions in degrees. If include.exact.dates = TRUE, the function returns exact dates for the max/min statistics. Note This function is designed for vector climate data, where the data includes both a magnitude and direction component. For scalar data, use compute.stat.scalar instead.

appears to be inserted before the See Also section on p. 65, where everything appears to pick up normally again.

Maybe this is roxygen, but it doesn't semantically make sense to me.

Thank you for pointing out this discrepancy. I've adjusted the roxygen comments so that their ordering matches the generated PDFs.

Roxygen automatically reorders the documentation tags to match the base R standard, as mentioned here:

"Note also that the order in which tags appear in your roxygen comments (or even in handwritten .Rd files) does not dictate the order in rendered documentation. The order of presentation is determined by tooling within base R."

I looked for an official documentation order, and the closest I found was this Writing R Extensions manual, but it doesn't encompass the full list of available tags in roxygen.

QSparks · 2024-10-28T21:13:50Z

Great work, Quintin! The code is very clean and clear.

Thanks, Rod!

The tests look solid, a lot of obvious cases covered, but I have not dug in to think through all the non-obvious ones. Are there any that occur to you after the fact?

I will continue to think of test cases while I work on #39. They will likely be focused on input validation.

A kind of generic question: Is there a prettifier/formatter for R? It seems possible it might be useful to standardize the code format, as I see a few indentation mistakes/inconsistencies and the like. (For application in a separate PR, I would think.)

There are two complementary packages, styler and lintr. Styler will prettify and lintr ensures conformity with a specific style (the Tidyverse style) and can be added to our CI workflow. I'll open a new issue for this.

QSparks added 30 commits October 12, 2023 11:52

Add date factor for meteorological seasons

ce18316

Clean up comments

094eff5

Add seasonal frequency to applicable indices and docs.

3b7fabc

Add tests for seasonal date factor

1515a0c

Add rx5day and rx1day to tests

25a454b

Add NA tests for all clim vars & month-season case

439fe42

Document season definitions, increment version.

38d9b17

Undo updates to changelog and description.

e51ae3f

Break seasonal indices tests up and correct comments.

0df8567

Remove redundant quantile validity check

dce1e7b

Refactor seasonal tests and add related list constants

7c4de3b

Add R-CMD-Check workflow

2b4b5e3

Address i29 and R CMD Check warning

99362de

Run roxygenize before R CMD Check

8ed9fc1

Fix indentation in .yaml

61729d3

Build docs in R-CMD-Check job

5331c67

Add lifecycle to extra-packages

445ab55

Run cross-platform checks on PR only

cbd9cfa

Merge branch 'CI-workflow' into feature/output-extreme-event-timing

e7763fe

Add exact date tests for n or x, rxnday and spells

c041a5e

Add exact dates tests for GSL

3fa3744

Update expected.GSL for southern hemisphere and leap years

bf64d0e

Directly access non-exported climdex.pcic functions

d5b10e3

Add 'next' call in rxnday tests when expected.value is NA

e276175

Resolve 'cannot coerce class "PCICt" to a data.frame'

5ac63d3

Add NA checks for values in n or x tests

770e767

Clarify ‘include.exact.dates’ param, fix seasonal na mask

34bf591

End-of-year tests, rename as.df to include.exact.dates

43d5fb2

Use checkIdentical in place of checkEqualsNumeric

f405ef5

QSparks added 10 commits September 23, 2024 13:58

Fix examples, 100-character lines and cross-references

51a6089

Skip running examples, fix namask add generic stats

7d0b0bd

export gen stat helpers, add Roxygen docstrings

caa00b1

WIP: test generic stats

cbdb918

WIP test generic vector stats

cc4a51a

Add additional testing for vector stats

453c326

Add additional testing for generics

0c09443

Add tests for generic variable inputs

101f188

Add jdays to raw climdexGenericVector

420db45

Merge branch 'master' into i38-generic-var-templates

79467f6

QSparks self-assigned this Oct 11, 2024

QSparks linked an issue Oct 11, 2024 that may be closed by this pull request

Add generic templates to support additional scalar and vector climate variables #38

Open

QSparks added 10 commits October 11, 2024 14:00

Clean up docs

49514aa

Error message for improperly formatted max.missing.days

7078bcb

Add formulas to the details of the conversion functions

294925d

Fix warnings when calculating circular stats on NA sets

96860b0

Use season levels instead of names for na mask test

6a44213

Add validation for calendar types and csv data cols

362eb55

Gen Vec. Use case-insensitive format, remove name param

efa037d

Use date.factor levels as names for circular stats

cc8f5b4

Test vector raw-csv equality, bad calendar exception

1f23db8

Fix formatting

18fd16f

QSparks requested review from rod-glover and corviday October 28, 2024 15:12

QSparks marked this pull request as ready for review October 28, 2024 15:29

rod-glover approved these changes Oct 28, 2024

View reviewed changes

Fix typo in doc, add default param for compute gen stat

01cea17

QSparks mentioned this pull request Oct 28, 2024

Prettify #41

Open

Reorder roxygen comments to match docs

0d6f547

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I38 generic var templates #40

I38 generic var templates #40

QSparks commented Oct 11, 2024

rod-glover left a comment

rod-glover Oct 28, 2024

QSparks Oct 28, 2024

rod-glover Oct 29, 2024

QSparks Oct 29, 2024

QSparks commented Oct 28, 2024

I38 generic var templates #40

Are you sure you want to change the base?

I38 generic var templates #40

Conversation

QSparks commented Oct 11, 2024

Description

Generic Scalar and Vector Classes

Scalar Data Handling

Vector Data Handling

Circular Statistics Support

Data Filtering by Direction

Utility Functions

Tests Added

rod-glover left a comment

Choose a reason for hiding this comment

rod-glover Oct 28, 2024

Choose a reason for hiding this comment

QSparks Oct 28, 2024

Choose a reason for hiding this comment

rod-glover Oct 29, 2024

Choose a reason for hiding this comment

QSparks Oct 29, 2024

Choose a reason for hiding this comment

QSparks commented Oct 28, 2024