DOC: Include instructions on how to access the datasets used for benchmarks #52

mgrover1 · 2024-03-20T19:53:11Z

To ensure reproducibility, instructions should be added on how to access datasets exclusively on LLNL resources, as with this example

https://github.com/xCDAT/xcdat-validation/blob/main/scripts/performance-benchmarks/perf_benchmark.py

This prevents full reproducibility of the results

Related to openjournals/joss-reviews#6426

tomvothecoder · 2024-03-21T20:27:43Z

Hey @pochedls, are all of these datasets available on ESGF? If they are, I'll check if Globus links are available to make it easier to download the larger datasets.

Also are the XML files publicly available somewhere so that non-LLNL users can open multi-file datasets with cdms2? Otherwise, we need to figure out another way to open multi-file datasets using cdms2.

pochedls · 2024-03-21T23:52:40Z

Hey @pochedls, are all of these datasets available on ESGF? If they are, I'll check if Globus links are available to make it easier to download the larger datasets.

Also are the XML files publicly available somewhere so that non-LLNL users can open multi-file datasets with cdms2? Otherwise, we need to figure out another way to open multi-file datasets using cdms2.

These datasets are all on ESGF.

XML files are usually created locally (to map to your local files on disk). Once the data is in place, you can create xml files with cdscan -x file.xml /path/to/dataset/ (pretty sure that is the syntax).

tomvothecoder · 2024-03-25T16:06:05Z

Thanks @pochedls. I'll need to update the instructions for the script so that non-LLNL users can reproduce the XMLs.

mgrover1 mentioned this issue Mar 20, 2024

[REVIEW]: xCDAT: A Python package for simple climate data analysis on structured grids openjournals/joss-reviews#6426

Closed

tomvothecoder self-assigned this Mar 21, 2024

tomvothecoder added the documentation Improvements or additions to documentation label Mar 21, 2024

tomvothecoder added this to xCDAT Development Mar 21, 2024

github-project-automation bot moved this to Todo in xCDAT Development Mar 21, 2024

tomvothecoder moved this from Todo to In Progress in xCDAT Development Mar 21, 2024

tomvothecoder mentioned this issue Mar 25, 2024

Update performance benchmark for reproducibility #54

Merged

7 tasks

tomvothecoder closed this as completed in #54 Apr 8, 2024

github-project-automation bot moved this from In Progress to Done in xCDAT Development Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Include instructions on how to access the datasets used for benchmarks #52

DOC: Include instructions on how to access the datasets used for benchmarks #52

mgrover1 commented Mar 20, 2024 •

edited

Loading

tomvothecoder commented Mar 21, 2024

pochedls commented Mar 21, 2024

tomvothecoder commented Mar 25, 2024

DOC: Include instructions on how to access the datasets used for benchmarks #52

DOC: Include instructions on how to access the datasets used for benchmarks #52

Comments

mgrover1 commented Mar 20, 2024 • edited Loading

tomvothecoder commented Mar 21, 2024

pochedls commented Mar 21, 2024

tomvothecoder commented Mar 25, 2024

mgrover1 commented Mar 20, 2024 •

edited

Loading