Truncated download results #645

tucotuco · 2017-12-17T12:09:21Z

From Scott Chamberlain...

"The user is using rvertnet::bigsearch - the interface to your download service. He was getting only 33K records (exactly that many, which makes it sound especially like a hard limit) for a search on class="Aves", while they are getting 210K records for class="Aves" + inst="UMMZ" . It seems that the first query should surely be a larger set of data than the second. So we're wondering if there's some kind of limit that is sometimes imposed, sometimes not. Because if it was always imposed, he would only get 33K for both of those queries."

tucotuco · 2017-12-17T12:09:29Z

There is a hard limit, but it is based on a Google Cloud storage concatenation limit, which is 1024 files. We make files of 1000 records each and join them to make the final download file, so the limit the way we are doing things is 1024000 records. Our reasoning is that, for anything bigger, people should be using the snapshots to avoid excessive costs to us. We'd have to look back through the logs and Google Cloud Storage to see if we can figure out why the Aves query (which WOULD fail to give all desired records) fails with 33k records.

tucotuco added API bug Download labels Dec 17, 2017

tucotuco self-assigned this Dec 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Truncated download results #645

Truncated download results #645

tucotuco commented Dec 17, 2017

tucotuco commented Dec 17, 2017

Truncated download results #645

Truncated download results #645

Comments

tucotuco commented Dec 17, 2017

tucotuco commented Dec 17, 2017