Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: Allow Natural Earth version to be specified #2351

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

lgolston
Copy link
Contributor

Rationale

As requested in #2293, it would be useful for reproducibility to be able to specify what Natural Earth version to use, in addition to current default behavior of downloading the latest version.

Implications

This enhancement is added. If a version is specified, it will be downloaded to subfolders within cultural or physical corresponding to the version requested and then used.

Checklist

Testing: Since Natural Earth is already in several places in the test suite, I modified one of them to now use a specific version.

@rcomer
Copy link
Member

rcomer commented Mar 15, 2024

Do we also need something to expose the version option on NaturalEarthFeature, similar to how scale is handled?

@lgolston
Copy link
Contributor Author

Do we also need something to expose the version option on NaturalEarthFeature, similar to how scale is handled?

Yes, thanks. Version should definitely be an option there too but I wasn't tracking that feature - have added it in now.

@lgolston
Copy link
Contributor Author

Related, I have linked there to https://github.com/nvkelso/natural-earth-vector/releases as a good source for version information, because it was confusing that most of the versions shown on https://www.naturalearthdata.com/ (e.g. 4.0.0, 2.0.0, etc.) are not available on the AWS service to download, yet there are many newer versioned releases that are available which are not shown on https://www.naturalearthdata.com/.

@lgolston lgolston force-pushed the nat-earth-version branch from b5d5a72 to 1654f0a Compare March 17, 2024 02:05
@lgolston
Copy link
Contributor Author

CI helped pick up an omission of version in one place in NaturalEarthFeature - a clean version is now pushed with that edit.

@rcomer rcomer requested a review from kaedonkers March 17, 2024 09:16
@rcomer rcomer linked an issue Mar 17, 2024 that may be closed by this pull request
Copy link
Member

@kaedonkers kaedonkers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An elegant solution, thank you for implementing this!
I have not yet tested it myself, but it looks like a viable solution to me

@@ -37,7 +37,7 @@ def test_natural_earth():
@pytest.mark.mpl_image_compare(filename='natural_earth_custom.png')
def test_natural_earth_custom():
ax = plt.axes(projection=ccrs.PlateCarree())
feature = cfeature.NaturalEarthFeature('physical', 'coastline', '50m',
feature = cfeature.NaturalEarthFeature('physical', 'coastline', '50m', '5.1.0',
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you find that version 5.1.0 successfully passed this test?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! Yes, no diff between the baseline image here of Iceland and when using 5.1.0

@@ -83,7 +83,8 @@ class TestRivers:
def setup_class(self):
RIVERS_PATH = shp.natural_earth(resolution='110m',
category='physical',
name='rivers_lake_centerlines')
name='rivers_lake_centerlines',
version='5.0.0')
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similarly, did this specific version (5.0.0) of rivers pass the image comparison tests?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, it was passing. Note there is no image comparison here, the test just looks at a few very specific features and there was no change.

@rcomer rcomer self-requested a review April 26, 2024 16:22
Copy link
Contributor

@greglucas greglucas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't test this out yet, but would you be able to add some description to the docstrings about where the versions are cached/stored? I think they are stored separately looking at the code, but just want to make sure we aren't getting a mixture of versions in the default namespace. If the latest is 5.1.2 and I explicitly ask for 5.1.2 do we have to download/store both?

Comment on lines 410 to 393
_NE_URL_TEMPLATE = ('https://naturalearth.s3.amazonaws.com/'
'{version}/{resolution}_{category}/'
'ne_{resolution}_{name}.zip')

default_spec = ('shapefiles', 'natural_earth', '{category}', '{version}',
'ne_{resolution}_{name}.shp')
Copy link
Contributor

@greglucas greglucas Jun 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would you be able to inject this into the current NE template? It looks like the only difference is {version}/, and I'm wondering if you could put that in and always have a version with something like _version_string = "" if version is None else f"{version}/".

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I couldn't figure out how to inject version without making the second default downloader, but I think this solution works! Only problem is it obscures the template slightly but worth it to avoid duplicating the other code. For the other question, yes, if latest is 5.1.2 and then you specifically ask for 5.1.2 it will download it twice, since version-specified data will go in its own subfolder (within the relevant physical or cultural folder). Unfortunately, the .VERSION file is currently filtered out so it is hard to tell what version you have downloaded in the base folder.

@greglucas
Copy link
Contributor

ping @lgolston, I had a few questions on this, but it looks like it may be very close to ready to go?

@lgolston
Copy link
Contributor Author

ping @lgolston, I had a few questions on this, but it looks like it may be very close to ready to go?

Sorry for the long delay. I have made an update based on your review, however there is a test error I need to look into.

@@ -251,6 +253,8 @@ def __init__(self, category, name, scale, **kwargs):
The dataset scale, i.e. one of '10m', '50m', or '110m',
or Scaler object. Dataset scales correspond to 1:10,000,000,
1:50,000,000, and 1:110,000,000 respectively.
version: optional
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you also update the above cases to follow numpydoc formatting by adding : str to all of them.

Suggested change
version: optional
version : str, optional

@@ -299,10 +301,11 @@ def natural_earth(resolution='110m', category='physical', name='coastline'):
# get hold of the Downloader (typically a NEShpDownloader instance)
# which we can then simply call its path method to get the appropriate
# shapefile (it will download if necessary)
_version_string = "" if version is None else f"{version}/"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This doesn't need to be private if it is just used within the method.

Suggested change
_version_string = "" if version is None else f"{version}/"
version_string = "" if version is None else f"{version}/"

@@ -386,7 +389,7 @@ def default_downloader():
ne_{resolution}_{name}.shp

"""
default_spec = ('shapefiles', 'natural_earth', '{category}',
default_spec = ('shapefiles', 'natural_earth', '{category}', '{version}',
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like the doctest failure is related to the docstring right above this. Where it got a /{version}/ added in now. I actually don't think that should be included there now because it is path joining the version, but we don't want that if version is None, we want to leave off the /. I guess the question is does this need to be udpated since it is the default case (no version) or can we just leave it alone for now?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Natural Earth: Include version in URL
4 participants