Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Store data in uint16 #66

Open
loliverhennigh opened this issue Dec 13, 2023 · 1 comment
Open

Store data in uint16 #66

loliverhennigh opened this issue Dec 13, 2023 · 1 comment

Comments

@loliverhennigh
Copy link

Hey,

Totally amazing project. Wanted to ask about compression/uint16 storage. When I get lat/lon ERA5 data from the CDS api I see the data is compressed and stored in uint16 along with scale factors to convert back to float32. When I look at the dataset here it seems to be in the full float32. Any reason not to store in the original uint16? Saves tremendous amounts of bandwidth and when using ERA5 in ML workflows this can make a big difference.

@shoyer
Copy link
Collaborator

shoyer commented Jun 14, 2024

This is definitely worth investigating. I know that we didn't do this for some 3D fields because the scale/offset factors that ECMWF uses differ for the same variable at different vertical levels in the atmosphere.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants