-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add quantile maps #30
Comments
To go into further detail on the two methodologies proposed: Old CIE methodology for the maps This is used to calculate the maps in the current CIE. The map show the median value of the indicator per gridpoint in a scenario and year and they are produced by ... This could be generalized to all percentiles by calculating the general percentile of the warming level in the scenario and year from the MAGICC distribution in (1) (instead of taking the median per default) and then calculating the percentile of the pooled maps in (3) (instead of taking the median per default) Advantages: Can be easily preprocessed for the CIE (e.g. by providing median and 95th percentile maps for all available warming levels and loading the map for the correct warming level whenever someone looks at a map) Methodology consistent with the rimeX emulator for time series In this method, we would calculate the quantile for each grid point the same way as for the aggregated time series (a gridpoint-wise weighted quantile weighted by the probability of their warming level in the scenario and time). Advantages: Consistent with the rimeX emulator for time series, uses all available information, mathematically correct grid point quantile |
How useful are these quantile maps? (in my mind they are not very useful...). I guess we could provide an approximation anyway. That's what I'd do for a reasonably accurate calculation:
Now if we use a relatively coarse grid so that we stay under 1e6 points overall, that could be reasonably fast. Though whether that would be fast enough for a server calculation, I don't know -- can you specify some kind of requirement in terms of memory usage and execution time ? A rougher approximation would be to compute the percentile due solely to the temperature forcing (using the median map), and another due solely to the model uncertainty (using a quantile maps like you describe above), and combine them by summing the squares of deviations from the median. So this requries computing the median map as well obviously. Three maps and the result = |
I have added a new The first iteration is on the server: A new iteration is coming using data relative to the projection baseline. (Mia provides several version, "abs", "rel", "diff" ... we provide only the default for each indicator) |
I just reduced the reference warming levels to |
Preliminary tests indicate that the number of warming levels does not really matter for the computation itself. However what mattered was the very inefficent import numpy as np
from scipy.interpolate import RegularGridInterpolator
def fast_quantile(a, quantiles, dim=None):
quantiles = np.asarray(quantiles)
if np.isscalar(quantiles):
a = a.reduce(np.percentile, quantiles*100, dim=dim)
else:
# "percentile" is orders of magnitude faster than "quantile"
a_np = np.percentile(a.values, quantiles*100, axis=a.dims.index(dim))
a = xa.DataArray(a_np,
coords=[quantiles] + [a.coords[c] for c in a.dims if c != dim],
dims=["quantile"] + [c for c in a.dims if c != dim])
return a
def make_prediction(a, gmt, years, samples, seed=42, quantiles=[0.5, .05, .95]):
interp = RegularGridInterpolator((a.warming_level.values, a.coords["quantile"].values), a.values, bounds_error=False)
rng = np.random.default_rng(seed=seed)
igmt = rng.integers(0, gmt.columns.size, size=samples)
gmt_decadal_resampled = gmt.loc[years].iloc[:, igmt]
iquantiles = rng.integers(0, a.coords["quantile"].size, size=gmt_decadal_resampled.shape)
sampled_maps = interp((gmt_decadal_resampled.values, a.coords["quantile"].values[iquantiles]))
sampled_maps = xa.DataArray(sampled_maps, coords=[
gmt_decadal_resampled.index, np.arange(samples), a.lat, a.lon], dims=["year", "sample", "lat", "lon"])
if quantiles is not None:
sampled_maps = fast_quantile(sampled_maps, quantiles, dim="sample")
return sampled_maps
sampled_maps = make_prediction(a, gmt, years=[2080], samples=100) |
I noticed some models have horizontal bands in the upper latitudes (canesm5, ipsl, uk). Probably a result of cdo remapbil interpolation from a non-regular grid (below SSP5-8.5 in 2080) This also shows up in the resulting maps: I guess I'll leave it for now that's something one might want to solve at a later point. |
Quoting @NiklasSchwind here
The text was updated successfully, but these errors were encountered: