Can CF offer better support for the representation of climate anomalies? #305
Replies: 10 comments 13 replies
-
Thanks for the great summary and for starting this discussion, @TomLav! In response to your questions:
I think your summary is an excellent starting point, I agree that we have identified a shortcoming, and I think that we should try to propose a solution for it here in this discussion.
I would like to move those questions forward as part of this discussion, because I fear that if we don't keep them in mind in trying to address the first issue, we may inadvertently cut ourselves off from a solution to the full set of issues. So, to summarize my last two posts: currently, the way we use (And once we start considering how to do that, it starts to look like maybe we shouldn't be thinking of climatology as something different and separate from a cell_method; they're both just aggregations.) |
Beta Was this translation helpful? Give feedback.
-
@davidhassell : you mentioned (here) you had some comments you would to make on this topic? |
Beta Was this translation helpful? Give feedback.
-
Hello. This discussion has not progressed, probably by lack of time before the summer and the summer break. From experience with past CF discussions, what is the best way to move such things forward? Ultimately we want to propose improvements to the CF convention (a new section?) to help better describe anomalies. Should this be done through online meetings? I am tagging here the people who showed interest so far: @davidhassell @sethmcg @JonathanGregory |
Beta Was this translation helpful? Give feedback.
-
Hello @TomLav Tuesday 17 September 16:00 - 17:30 CEST (UTC+2): Metadata for describing statistical processing It would be excellent if you were able to join this session (as well as the whole workshop of course). Moreover, if you have more concrete ideas that you would like to work on hackathon-style (maybe as an outcome of the conversation during the session) do not hesitate to suggest a hackathon theme in this discussion topic. |
Beta Was this translation helpful? Give feedback.
-
I have some interest in this topic. From scanning the discussions so far it seems like this could get pretty complicated! My initial thoughts are that there are three things we need to capture:
|
Beta Was this translation helpful? Give feedback.
-
Here's a concrete (and fairly simple) use case, in case it helps with the discussion. I work with gridded datasets derived from in situ station observations. One example of a parameter we routinely generate is the monthly mean of daily maximum temperature. Our grids for individual months (e.g. July 2024) have A couple of things occured to me while writing this: The fact that these data are computed from daily maximum temperatures is obviously a key part of the description. However it is not actually a processing step that data providers such as myself would ever undertake. It is either a characteristic of the observation itself (in the case of liquid-in-glass thermometers) or it is a function of the data logger (in the case of electrical resistance thermometers). Secondly, in our case both the gridding process and the calculation of the anomaly come after the aggregation steps described in the cell methods have been applied to the station data. However, other approaches are equally valid e.g. creating a separate grid for each day from the station maxima and then creating the monthly grid by averaging the daily grids. In this case 'gridding' comes between |
Beta Was this translation helpful? Give feedback.
-
For the use case of maximum temperature anomaly, what standard name are you using for the anomaly (there is the |
Beta Was this translation helpful? Give feedback.
-
@DanHollis -- Thanks for those comments! Those are very helpful examples and really clarified for me what it is we are trying to accomplish here. The reason why we need to record the sequence of transformation applied to a variable is not to record the provenance in a way that lets you perfectly recreate the workflow that generated the file, but to record the chain of derivation that changes the meaning of the data. So the point is to have something that will let you clearly distinguish between things like "climatological mean of the monthly maximum of tmin" and "climatological minimum of the monthly mean of tmax" that can be confusing or ambiguous under the current representation scheme that uses only That's a very helpful reduction in scope, because it dramatically simplifies what we need to record, and I think makes it far more achievable to come up with something that is both human- and machine-readable. For something like a bias-correction transformation, we don't need to record all the details that would let you reproduce it, just that bias correction has happened. Ancillary information like (type of) algorithm, observational dataset, and reference period are important, but we don't need things like implementation details, library versions, parameters, etc. Great! I think that's doable. |
Beta Was this translation helpful? Give feedback.
-
Dear all Unfortunately I haven't had time to think about this issue in the months since it began. From reading through it just now, I agree with the scope described by @sethmcg and supported by @DanHollis: The reason why we need to record the sequence of transformations applied to a variable is not to record the provenance in a way that lets you perfectly recreate the workflow that generated the file, but to record the chain of derivation that changes the meaning of the data. But this is the aim of Unfortunately in Sect 7.3 we don't state what the purpose of cell methods is! Essentially, it is to record how the value in the cell relates to the variation of data from which it was computed at a different resolution and over other dimensions. As in this discussion, an important feature of Best wishes Jonathan |
Beta Was this translation helpful? Give feedback.
-
Dear @TomLav I hear your pressing need for a solution to a specific problem. Please could you remind me (us) of the use-cases which you need to be able to handle now, for which the current convention is insufficient? Maybe we can devise something limited and adequate. Dear @sethmcg I would be willing to join in with reviewing, and if necessary expanding, generalising or overhauling If it turns out that Best wishes Jonathan |
Beta Was this translation helpful? Give feedback.
-
Topic for discussion
We continue the discussion started as issue #252. [It's now
vocabularies
issue 70, but you can still reference it asdiscuss
issue 252.]A summary of the discussion so far is in this post.
Tagging @JonathanGregory, @davidhassell and @sethmcg.
[Discussion #372 may be relevant. It was opened by @larsbarring in 2019 as issue #197 in the conventions repository, and I have converted it to a Discussion. Jonathan]
Beta Was this translation helpful? Give feedback.
All reactions