-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sporadic (file size?)-related 404 errors from ERDDAP datasets when running IOOS 1.2 test #804
Comments
You should be able to use the previously mentioned .ncCF URL at the end here as it returns |
@benjwadams For ease-of-use, I suspect users will want to provide the root ERDDAP dataset URLs without extensions/query parameters in most cases (I know I will). If it works with user-provided extensions/query params, that's great and a good workaround, but I'd still like to find a solution for the simplest use case that works. What would the best choice for filtering with a root dataset URL? Good point about not depending on Options:
Some of the datasets I tested with were over 100 MB in size when downloaded in full, and I'm sure there are much larger. We need a sane default subset approach to make this option viable. |
I would personally probably go for option 2) possibly combined with some of the axis finding logic already present in compliance-checker. |
Ok, works for me. Please go ahead with that approach at your earliest convenience. |
Related: #807 |
After several profiling runs for the WQB-04 dataset, I'm still perplexed as to why sporadic 404's occur. It may be due to a deeper interaction between the |
With recent changes to download
.ncCF
file format from ERDDAP URLs, I get occasional HTTP 404 responses, in particular for large datasets. Here's a PacIOOS example:Happens for other large(r) ERDDAP datasets as well.
Can we run all of our IOOS 1.2 checks if we don't request the full dataset worth of data, and instead filter by a recent slices of the time dimension?
ERDDAP includes the
max()
server-side function for this purpose. Request most recent 3 days of data.:Request the most recent time slice:
or, the same for this particular PacIOOS dataset:
This is something we'll probably need to fix before next RC is put out. Not sure whether it's best to grab the most recent slice or use something like
max(time)-7days
, or which of these will be least brittle. Thoughts?@daltonkell @benjwadams
The text was updated successfully, but these errors were encountered: