-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
directory search order for CMIP data sets #3162
Comments
If the person in charge of administrating the CMIP5 data on Jasmin does not have time to remove the corrupted file before the tutorial, I would recommend creating a copy of the recipe specifically for the tutorial and change it so it uses a different dataset. |
@valeriupredoi - I am posting the update from the CEDA help desk: It seems that the
That version also does not exist on any other ESGF sites, so there is nowhere to get a good copy from. Also, this dataset is marked as Instead, the version called
Looking inside the So the full path to the areacella file you need is:
|
In that case, you may be able to solve the problem by specifying the version in the recipe. Can you try replacing datasets:
- {dataset: bcc-csm1-1, project: CMIP5, exp: historical, ensemble: r1i1p1} by datasets:
- {dataset: bcc-csm1-1, version: v1, project: CMIP5, exp: historical, ensemble: r1i1p1} |
Hi @bouweandela - This works too. Thanks! However, in the future if a file gets corrupted on a machine or is empty for whatever reason, should we have the option of choosing our local copy? Can we force the option of picking the downloaded copy or is that not a desirable feature/option to have? |
The idea of being able to blacklist files that are somehow wrong but cannot be controlled by the user has been discussed before, though I cannot find back the issue now. It seems useful, maybe it could be implemented as part of a wider effort to improve how the data finding is configured. See ESMValGroup/ESMValCore#1894 (comment) for a proposal and some previous discussion. |
Hi,
I was testing the recipe,
recipe_python.yml
for the proposed ESMVAlTool tutorial this week on the JASMIN HPC and noticed that one of the datasets necessary for this recipe is possibly corrupted on CEDA's badc archive. I have reported this to CEDA but have a question on how to work around this error. Normally, if we find issues with the data, we would download the data we need and run our recipes. However, in this case, because the file exists on the badc archive, ESMValTool repeatedly finds that file on badc first instead of the one I downloaded and quits with the error message.The error message is :
The following changes in the
config-user.yml
file to the CMIP5 path on JASMIN and will work around the error:I could not add my download directory and the badc directory to the list because they have different path formats -
ESGF
andBADC
.I suspect that this happens because, the JASMIN path is searched by default. If the specified file is not found, esmvaltool will look in the download directory but if the file exists and is corrupted, it will not look further. Is my understanding correct and is this the only workaround? The tutorial is on Thursday and it will be good to know how best to handle it by then if possible.
Tagging both @valeriupredoi and @bouweandela for a response due to the time constraint. Apologies for the rush.
Thanks!
-Ranjini
The text was updated successfully, but these errors were encountered: