Update batch regional download example #2

ansoncfit · 2024-04-17T18:22:12Z

To use new /all grid endpoint

batch-result-download.ipynb

Co-authored-by: Andrew Byrd <[email protected]>

abyrd · 2024-04-18T04:26:44Z

batch-result-download.ipynb

+    "\n",
+    "        # Save result to a local .zip file\n",
+    "        with open('results/'  + analysisId + '.zip', 'wb') as f:\n",
+    "            for chunk in zipRequest.iter_content(chunk_size=128):\n",


Although the library's syntax implies this is operating on a stream of chunks (and the documentation at https://fr.python-requests.org/en/latest/api.html also seems to imply this is the point of chunk_size), I wonder if this may actually buffer the entire response in memory. While that wouldn't be much of an issue when working with one geotiff at a time, it's probably not ideal to buffer an entire zip containing an unknown number of geotiffs. Commentary on stackoverflow seems to echo this concern: https://stackoverflow.com/q/16694907/778449
But other commentary specifically says iter_chunks exists to solve the buffering problem:
https://stackoverflow.com/questions/46205586/why-to-use-iter-content-and-chunk-size-in-python-requests

There are also several edge cases that could occur when downloading and saving a large file, which might be better handled by a purpose-built "save a file" method instead of a simple loop over the content.

Side note: chunk_size is in bytes according to the Requests docs. 128 bytes is a very small chunk size for file IO. It is probably more efficient to work with larger sizes. But ideally a detail like this will be encapsulated in whatever method we use to save the file.

I will research better ways to handle this.

Corresponding PR at #3.

Update batch regional download example

1c9a07a

To use new /all grid endpoint

ansoncfit requested a review from abyrd April 17, 2024 18:23

Merge branch 'master' into update-batch-download

1e25ce9

abyrd requested changes Apr 18, 2024

View reviewed changes

batch-result-download.ipynb Outdated Show resolved Hide resolved

Clarify contents of .zip file

edcdd83

Co-authored-by: Andrew Byrd <[email protected]>

abyrd reviewed Apr 18, 2024

View reviewed changes

ansoncfit merged commit 01bfd61 into master Apr 18, 2024

ansoncfit deleted the update-batch-download branch April 18, 2024 19:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update batch regional download example #2

Update batch regional download example #2

ansoncfit commented Apr 17, 2024

abyrd Apr 18, 2024 •

edited

Loading

abyrd Apr 19, 2024

Update batch regional download example #2

Update batch regional download example #2

Conversation

ansoncfit commented Apr 17, 2024

abyrd Apr 18, 2024 • edited Loading

Choose a reason for hiding this comment

abyrd Apr 19, 2024

Choose a reason for hiding this comment

abyrd Apr 18, 2024 •

edited

Loading