Store data in uint16 #66

loliverhennigh · 2023-12-13T20:57:37Z

Hey,

Totally amazing project. Wanted to ask about compression/uint16 storage. When I get lat/lon ERA5 data from the CDS api I see the data is compressed and stored in uint16 along with scale factors to convert back to float32. When I look at the dataset here it seems to be in the full float32. Any reason not to store in the original uint16? Saves tremendous amounts of bandwidth and when using ERA5 in ML workflows this can make a big difference.

shoyer · 2024-06-14T16:13:30Z

This is definitely worth investigating. I know that we didn't do this for some 3D fields because the scale/offset factors that ECMWF uses differ for the same variable at different vertical levels in the atmosphere.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store data in uint16 #66

Store data in uint16 #66

loliverhennigh commented Dec 13, 2023

shoyer commented Jun 14, 2024

Store data in uint16 #66

Store data in uint16 #66

Comments

loliverhennigh commented Dec 13, 2023

shoyer commented Jun 14, 2024