Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

astrodata #181

Open
16 of 32 tasks
teald opened this issue May 13, 2024 · 10 comments
Open
16 of 32 tasks

astrodata #181

teald opened this issue May 13, 2024 · 10 comments
Assignees
Labels
4/reviews-in-awaiting-changes astropy An astropy community affiliated package review

Comments

@teald
Copy link

teald commented May 13, 2024

Submitting Author: D.J. Teal (@teald)
All current maintainers: @teald, @chris-simpson, @jehturner
Package Name: astrodata
One-Line Description of Package: Common interface for astronomical data products.
Repository Link: https://github.com/GeminiDRSoftware/astrodata
Version submitted: 2.9.2
Editor: @hamogu
Reviewer 1: @aaryapatil
Reviewer 2: @mwcraig
Archive: TBD
JOSS DOI: TBD
Version accepted: TBD
Date accepted (month/day/year): TBD


Code of Conduct & Commitment to Maintain Package

Description

astrodata is a package meant to facilitate developing common interfaces for astronomical data formats. Often, specific instruments and models will have different ways of storing their data, including metadata. astrodata offers a single interface that uses metadata to resolve these disparate file formats, while enabling common operations and values to share the same interface. The abstraction is meant to be conceptually simple and meaningful to scientists.

Previously, astrodata was a core module within the DRAGONS package, and has been used for the various instruments that exist at the Gemini Observatory. It has proved useful in consolidating differences in metadata and data formatting between instruments that produce FITS files, which is a common pattern in astronomical data handling. Alongside automating interface selection based on these differences, it also comes with helpful operators and methods out-of-the-box, by extending astropy's NDData class.

Scope

  • Please indicate which category or categories.
    Check out our package scope page to learn more about our
    scope. (If you are unsure of which category you fit, we suggest you make a pre-submission inquiry):

    • Data retrieval
    • Data extraction
    • Data processing/munging
    • Data deposition
    • Data validation and testing
    • Data visualization1
    • Workflow automation
    • Citation management and bibliometrics
    • Scientific software wrappers
    • Database interoperability

Domain Specific

  • Geospatial
  • Education

Community Partnerships

If your package is associated with an
existing community please check below:

  • For all submissions, explain how the and why the package falls under the categories you indicated above. In your explanation, please address the following points (briefly, 1-2 sentences for each):

    • Who is the target audience and what are scientific applications of this package?

      Astronomers & astronomical software developers

    • Are there other Python packages that accomplish the same thing? If so, how does yours differ?

      There are no specific packages we are aware of. There is some overlap with the gwcs package and our wcs module, but we are planning to collaborate with that package for future development and to reconsolidate those overlaps. Astrodata offers an interface based on astropy.nddata.NDData, but allowing more than one instance to be mapped to the same file (e.g., to multiple sets of FITS extensions).

    • If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted:

      N/A

Technical checks

For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:

  • does not violate the Terms of Service of any service it interacts with.
  • uses an OSI approved license.
  • contains a README with instructions for installing the development version.
  • includes documentation with examples for all functions.
  • contains a tutorial with examples of its essential functions and uses.
  • has a test suite.
  • has continuous integration setup, such as GitHub Actions CircleCI, and/or others.

Publication Options

JOSS Checks
  • The package has an obvious research application according to JOSS's definition in their submission requirements. Be aware that completing the pyOpenSci review process does not guarantee acceptance to JOSS. Be sure to read their submission requirements (linked above) if you are interested in submitting to JOSS.
  • The package is not a "minor utility" as defined by JOSS's submission requirements: "Minor ‘utility’ packages, including ‘thin’ API clients, are not acceptable." pyOpenSci welcomes these packages under "Data Retrieval", but JOSS has slightly different criteria.
  • The package contains a paper.md matching JOSS's requirements with a high-level description in the package root or in inst/.
  • The package is deposited in a long-term repository with the DOI:

Note: JOSS accepts our review as theirs. You will NOT need to go through another full review. JOSS will only review your paper.md file. Be sure to link to this pyOpenSci issue when a JOSS issue is opened for your package. Also be sure to tell the JOSS editor that this is a pyOpenSci reviewed package once you reach this step.

Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?

This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.

  • Yes I am OK with reviewers submitting requested changes as issues to my repo. Reviewers will then link to the issues in their submitted review.

Confirm each of the following by checking the box.

  • I have read the author guide.
  • I expect to maintain this package for at least 2 years and can help find a replacement for the maintainer (team) if needed.

Please fill out our survey

P.S. Have feedback/comments about our review process? Leave a comment here

Editor and Review Templates

The editor template can be found here.

The review template can be found here.

Footnotes

  1. Please fill out a pre-submission inquiry before submitting a data visualization package.

@Batalex
Copy link
Contributor

Batalex commented May 26, 2024

Editor in Chief checks

Hi there! Thank you for submitting your package for pyOpenSci review. Below are the basic checks that your package needs to pass to begin our review. If some of these are missing, we will ask you to work on them before the review process begins.

Please check our Python packaging guide for more information on the elements below.

  • Installation The package can be installed from a community repository such as PyPI (preferred), and/or a community channel on conda (e.g. conda-forge, bioconda).
    • The package imports properly into a standard Python environment import package.
      The package installation does not install the dependencies
  • Fit The package meets criteria for fit and overlap.
  • Documentation The package has sufficient online documentation to allow us to evaluate package function and scope without installing the package. This includes:
    • User-facing documentation that overviews how to install and start using the package.
    • Short tutorials that help a user understand how to use the package and what it can do for them.
    • API documentation (documentation for your code's functions, classes, methods and attributes): this includes clearly written docstrings with variables defined using a standard docstring format.
  • Core GitHub repository Files
    • README The package has a README.md file with clear explanation of what the package does, instructions on how to install it, and a link to development instructions.
    • Contributing File The package has a CONTRIBUTING.md file that details how to install and contribute to the package.
    • Code of Conduct The package has a CODE_OF_CONDUCT.md file.
    • License The package has an OSI approved license.
      Seems like a BSD3 derivative is ok.

NOTE: We prefer that you have development instructions in your documentation too.

  • Issue Submission Documentation All of the information is filled out in the YAML header of the issue (located at the top of the issue template).
  • Automated tests Package has a testing suite and is tested via a Continuous Integration service.
  • Repository The repository link resolves correctly.
  • Package overlap The package doesn't entirely overlap with the functionality of other packages that have already been submitted to pyOpenSci.
  • Archive (JOSS only, may be post-review): The repository DOI resolves correctly.
  • Version (JOSS only, may be post-review): Does the release version given match the GitHub release (v1.0.0)?

  • Initial onboarding survey was filled out
    We appreciate each maintainer of the package filling out this survey individually. 🙌
    Thank you authors in advance for setting aside five to ten minutes to do this. It truly helps our organization. 🙌


Editor comments

Nothing to say from the EiC initial perspective, it's a strong submission!

@Batalex Batalex added astropy An astropy community affiliated package review and removed New Submission! labels May 26, 2024
@Batalex
Copy link
Contributor

Batalex commented Jun 4, 2024

Hey there @teald,

@hamogu will take over this submission as the editor 🫶 You are in good hands, happy reviewing y'all!

@hamogu
Copy link

hamogu commented Jun 5, 2024

Editor response to review:


Editor comments

👋 Hi @aaryapatil and @mwcraig! Thank you for volunteering to review
for pyOpenSci!

Please fill out our pre-review survey

Before beginning your review, please fill out our pre-review survey. This helps us improve all aspects of our review and better understand our community. No personal data will be shared from this survey - it will only be used in an aggregated format by our Executive Director to improve our processes and programs.

  • reviewer 1 survey completed.
  • reviewer 2 survey completed.
  • reviewer 3 (if applicable)

The following resources will help you complete your review:

  1. Here is the reviewers guide. This guide contains all of the steps and information needed to complete your review.
  2. Here is the review template that you will need to fill out and submit
    here as a comment, once your review is complete.

Please get in touch with any questions or concerns! Your review is due:

Reviewers: @aaryapatil @mwcraig
Due date: end of June

@hamogu
Copy link

hamogu commented Jun 6, 2024

@teald : @aaryapatil @mwcraig have agreed to review your package! Thanks! Links to the review guide and the template to get them started are in the post above; and of course they might open issues (or PRs!) in your repro if they have suggestions. In the meantime, feel free to reach out to me with any questions you might have.

@hamogu
Copy link

hamogu commented Jul 2, 2024

@aaryapatil @mwcraig : I remember you both told me that you would need a little longer than the "three weeks" that we usually allocate to the first round of review and with the holiday coming up in the US, I understand that you are probably busy with other things this week. However, if you can make a realistic estimate when you can find time to review, it would be great if you could post here, so that the package authors know where we are in the process. Thank you so much for agreeing to review this package!

@aaryapatil
Copy link

aaryapatil commented Jul 3, 2024

@hamogu Thank you for the ping. I am working on the review now but won't have much time this week. Realistically, I should have my review done by next week. Hope that is okay!

UPDATE: The rest of the review will be completed by July 16.

@mwcraig
Copy link

mwcraig commented Jul 3, 2024

@hamogu -- thanks for the reminder; I should be able to get it done by the end of the day on Monday, July 15.

@mwcraig
Copy link

mwcraig commented Jul 15, 2024

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

  • As the reviewer I confirm that there are no conflicts of interest for me to review this work (If you are unsure whether you are in conflict, please speak to your editor before starting your review).

Documentation

The package includes all the following forms of documentation:

  • A statement of need clearly stating problems the software is designed to solve and its target audience in README.
  • Installation instructions: for the development version of the package and any non-standard dependencies in README.
  • Vignette(s) demonstrating major functionality that runs successfully locally.
  • Function Documentation: for all user-facing functions.
  • 👉 Examples for all user-facing functions. 👈
    • I looked at the functions in the Common API for Users. None have a function in the docstring, but all but add_header_to_table are well described elsewhere in the documentation. The only mention of add_header_to_table was in the MEF documentation and doesn't include any explanation. I guess the main thing I'm confused about is whether add_header_to_table adds an empty header that the user then needs to fill or whether it adds already-existing data.
    • To be clear, I think the only case where this really needs to be addressed is add_header_to_table and it could be addressed by saying a little more in the documentation about it.
    • However, the current Examples documentation have no actual examples. That is an issue.
  • Community guidelines including contribution guidelines in the README or CONTRIBUTING. 🎉 really love the flowchart! \tada
  • 👉 Metadata including author(s), author e-mail(s), a url, and any other relevant metadata e.g., in a pyproject.toml file or elsewhere. 👈
    • Adding urls to pyproject.toml that points to the project home on github and to the documentation would be great. That would add a "Project links" section in pypi. See the astropy pyproject.toml for how to add the URLs

Readme file requirements
The package meets the readme requirements below:

  • Package has a README.md file in the root directory.

The README should include, from top to bottom:

  • The package name
  • Badges for:
    • Continuous integration and test coverage,
    • Docs building (if you have a documentation website),
    • 👉 A repostatus.org badge, 👈
    • 👉 Python versions supported, 👈
    • Current package version (on PyPI / Conda).

NOTE: If the README has many more badges, you might want to consider using a table for badges: see this example. Such a table should be more wide than high. (Note that the a badge for pyOpenSci peer-review will be provided upon acceptance.)

  • Short description of package goals.
  • Package installation instructions
  • Any additional setup required to use the package (authentication tokens, etc.)
  • Descriptive links to all vignettes. If the package is small, there may only be a need for one vignette which could be placed in the README.md file.
    • Brief demonstration of package usage (as it makes sense - links to vignettes could also suffice here if package description is clear)
  • Link to your documentation website.
  • If applicable, how the package compares to other similar packages and/or how it relates to other packages in the scientific ecosystem.
  • 👉 Citation information 👈 -- there is a CITATION.md, but consider adding a link to it in the README.md

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole.
Package structure should follow general community best-practices. In general please consider whether:

  • Package documentation is clear and easy to find and use.
  • The need for the package is clear
  • All functions have documentation and associated examples for use
  • The package is easy to install

Functionality

  • Installation: Installation succeeds as documented.
  • Functionality: Any functional claims of the software been confirmed.
  • Performance: Any performance claims of the software been confirmed.
  • Automated tests:
    • 👉 All tests pass on the reviewer's local machine for the package version submitted by the author. Ideally this should be a tagged version making it easy for reviewers to install. 👈
      • Three tests failed, but that may have eben because one of the downloads failed (I know gemini is having network issues today), even in a run where there were no download errors that I saw looks like these were due to connection failure, see traceback below:
        • tests/test_object_construction.py::test_append_array_to_root_no_name ERROR
        • tests/test_object_construction.py::test_append_array_to_root_with_name_sci ERROR
        • tests/test_object_construction.py::test_append_array_to_root_with_arbitrary_name ERROR
      • Clear instructions for how to run the tests would be great -- I did a copy/paste from the github action workflow and tox started doing its thing, but I would have liked to have restricted the tests to just python 3.11. tox tried to do several versions.
    • Tests cover essential functions of the package and a reasonable range of inputs and conditions.
  • Continuous Integration: Has continuous integration setup (We suggest using Github actions but any CI platform is acceptable for review)
  • Packaging guidelines: The package conforms to the pyOpenSci packaging guidelines.
    A few notable highlights to look at:
    • Package supports modern versions of Python and not End of life versions.
    • Code format is standard throughout package and follows PEP 8 guidelines (CI tests for linting pass)
      • 👉 I don't see a lint test on the CI. I gather from some code comments that linting is done with pylint. Running pylint astrodata on the source generates a number of errors, but these are almost all complaints about code complexity rather than format. More explicit instructions for running the lint test would be helpful, but the formatting is fine.

For packages also submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

  • A short summary describing the high-level functionality of the software
  • Authors: A list of authors with their affiliations
  • A statement of need clearly stating problems the software is designed to solve and its target audience.
  • References: With DOIs for all those that have one (e.g. papers, datasets, software).

Final approval (post-review)

  • The author has responded to my review and made changes to my satisfaction. I recommend approving this package.

Estimated hours spent reviewing:


Review Comments

@teald this is ready for a look

**Thanks for submitting this! 🎉 ** I am really excited about the potential of this package to fill some holes in the ecosystem. There are some details below in the "folded" sections that should be straightforward to address. Here are the high-level comments (background: I'm one of the maintainers for astropy.nddata and the sole maintainer for ccdproc):

  1. What is your conception of how this should interact with the rest of the ecosystem (thinking mostly about nddata and ccdproc here)?
  2. Does it make sense to upstream any of this (like the arithmentic handling or allowing for any WCS, not just an astropy.wcs) to astropy.nddata? Or to ccdproc?
  3. Does it make sense for ccdproc to depend on astrodata or try to integrate usage of astrodata into it? ccdproc has never had a good way of handling MEF files, which is faintly ridiculous (I'm the maintainer of ccdproc so I'm looking in the mirror rather throwing stones here).
  4. My take is that astrodata provides a way to abstract images and metadata from the underlying way they are stored, which is something that none of the current tools that I'm aware of provide. It may very well not make sense to upstream any of this.
  5. Would it be possible to provide a small example of how to develop a processing tool with astrodata that goes beyond just adding properties and tags? In otherwords, once I have done those things what does astrodata do for me? I'm not suggesting a full reduction pipeline here (DRAGONS does that) but something that shows a step or two of processing files using would be helpful.
Issues/PRs opened as part of review:
test error traceback
_____________________________ ERROR at setup of test_append_array_to_root_with_arbitrary_name ______________________________

filename = 'N20160524S0119.fits', path = '/Users/mattcraig/development/astronomy/astrodata/.tox/py311/_test_cache'
sub_path = 'raw_files', env_var = 'ASTRODATA_TEST', cache = True, fail_on_error = True

    def download_from_archive(
        filename,
        path=None,
        sub_path="raw_files",
        env_var="ASTRODATA_TEST",
        cache=True,
        fail_on_error=True,
    ):
        """Download file from the archive and store it in the local cache.

        Parameters
        ----------
        filename : str
            The filename, e.g. N20160524S0119.fits

        path : str or os.PathLike or None
            Path to the cache directory. If None, the environment variable
            ASTRODATA_TEST is used. otherwise, the file is saved to:
            os.path.join(path, sub_path, filename)

        sub_path : str
            By default the file is stored at the root of the cache directory, but
            using ``path`` allows to specify a sub-directory.

        env_var: str
            Environment variable containing the path to the cache directory.

        cache : bool
            If False, the file is downloaded and replaced in the cache directory.

        fail_on_error : bool
            If True, raise an error if the download fails. If False, return None.

        Returns
        -------
        str
            Name of the cached file with the path added to it.
        """
        # Handle None sub_path
        if sub_path is None:
            sub_path = ""
            warnings.warn(
                "sub_path is None, so the file will be saved to the root of the "
                "cache directory. To suppress this warning, set sub_path to a "
                "valid path (e.g., empty string)."
            )

        # Check that the environment variable is a valid name.
        if not isinstance(env_var, str) or not env_var.isidentifier():
            raise ValueError(f"Environment variable name is not valid: {env_var}")

        # Find cache path and make sure it exists
        root_cache_path = os.getenv(env_var)

        if root_cache_path is None:
            if path is not None:
                root_cache_path = os.path.expanduser(path)

            else:
                root_cache_path = os.path.join(os.getcwd(), "_test_cache")
                warnings.warn(
                    f"Environment variable not set: {env_var}, writing "
                    f"to {root_cache_path}. To suppress this warning, set "
                    f"the environment variable {env_var} to the desired path "
                    f"for the testing cache."
                )

                # This is cleaned up once the program finishes.
                os.environ[env_var] = str(root_cache_path)

        root_cache_path = os.path.expanduser(root_cache_path)

        if path is None:
            path = root_cache_path

        cache_path = os.path.join(os.path.expanduser(path), sub_path)

        if not os.path.exists(cache_path):
            os.makedirs(cache_path)

        # Now check if the local file exists and download if not
        try:
            local_path = os.path.join(cache_path, filename)
            url = GEMINI_ARCHIVE_URL + filename

            if cache and os.path.exists(local_path):
                # Use the cached file
                return local_path

>           tmp_path = download_file(url, cache=False)

astrodata/testing.py:479:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
.tox/py311/lib/python3.11/site-packages/astropy/utils/data.py:1529: in download_file
    f_name = _download_file_from_source(
.tox/py311/lib/python3.11/site-packages/astropy/utils/data.py:1313: in _download_file_from_source
    with _try_url_open(
.tox/py311/lib/python3.11/site-packages/astropy/utils/data.py:1227: in _try_url_open
    return urlopener.open(req, timeout=timeout)
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/urllib/request.py:519: in open
    response = self._open(req, data)
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/urllib/request.py:536: in _open
    result = self._call_chain(self.handle_open, protocol, protocol +
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/urllib/request.py:496: in _call_chain
    result = func(*args)
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/urllib/request.py:1391: in https_open
    return self.do_open(http.client.HTTPSConnection, req,
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/urllib/request.py:1352: in do_open
    r = h.getresponse()
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/http/client.py:1395: in getresponse
    response.begin()
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/http/client.py:325: in begin
    version, status, reason = self._read_status()
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/http/client.py:286: in _read_status
    line = str(self.fp.readline(_MAXLINE + 1), "iso-8859-1")
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/socket.py:706: in readinto
    return self._sock.recv_into(b)
../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/ssl.py:1314: in recv_into
    return self.read(nbytes, buffer)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

self = <ssl.SSLSocket [closed] fd=-1, family=2, type=1, proto=0>, len = 8192, buffer = <memory at 0x139cdcdc0>

    def read(self, len=1024, buffer=None):
        """Read up to LEN bytes and return them.
        Return zero-length string on EOF."""

        self._checkClosed()
        if self._sslobj is None:
            raise ValueError("Read on closed or unwrapped SSL socket.")
        try:
            if buffer is not None:
>               return self._sslobj.read(len, buffer)
E               TimeoutError: The read operation timed out

../../../mambaforge/envs/astrodata-review-dev/lib/python3.11/ssl.py:1166: TimeoutError

The above exception was the direct cause of the following exception:

    @pytest.fixture
    def testfile2():
        """
        Pixels Extensions
        Index  Content                  Type              Dimensions     Format
        [ 0]   science                  NDAstroData       (4608, 1056)   uint16
        [ 1]   science                  NDAstroData       (4608, 1056)   uint16
        [ 2]   science                  NDAstroData       (4608, 1056)   uint16
        [ 3]   science                  NDAstroData       (4608, 1056)   uint16
        [ 4]   science                  NDAstroData       (4608, 1056)   uint16
        [ 5]   science                  NDAstroData       (4608, 1056)   uint16
        """
>       return download_from_archive("N20160524S0119.fits")

tests/test_object_construction.py:42:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

filename = 'N20160524S0119.fits', path = '/Users/mattcraig/development/astronomy/astrodata/.tox/py311/_test_cache'
sub_path = 'raw_files', env_var = 'ASTRODATA_TEST', cache = True, fail_on_error = True

    def download_from_archive(
        filename,
        path=None,
        sub_path="raw_files",
        env_var="ASTRODATA_TEST",
        cache=True,
        fail_on_error=True,
    ):
        """Download file from the archive and store it in the local cache.

        Parameters
        ----------
        filename : str
            The filename, e.g. N20160524S0119.fits

        path : str or os.PathLike or None
            Path to the cache directory. If None, the environment variable
            ASTRODATA_TEST is used. otherwise, the file is saved to:
            os.path.join(path, sub_path, filename)

        sub_path : str
            By default the file is stored at the root of the cache directory, but
            using ``path`` allows to specify a sub-directory.

        env_var: str
            Environment variable containing the path to the cache directory.

        cache : bool
            If False, the file is downloaded and replaced in the cache directory.

        fail_on_error : bool
            If True, raise an error if the download fails. If False, return None.

        Returns
        -------
        str
            Name of the cached file with the path added to it.
        """
        # Handle None sub_path
        if sub_path is None:
            sub_path = ""
            warnings.warn(
                "sub_path is None, so the file will be saved to the root of the "
                "cache directory. To suppress this warning, set sub_path to a "
                "valid path (e.g., empty string)."
            )

        # Check that the environment variable is a valid name.
        if not isinstance(env_var, str) or not env_var.isidentifier():
            raise ValueError(f"Environment variable name is not valid: {env_var}")

        # Find cache path and make sure it exists
        root_cache_path = os.getenv(env_var)

        if root_cache_path is None:
            if path is not None:
                root_cache_path = os.path.expanduser(path)

            else:
                root_cache_path = os.path.join(os.getcwd(), "_test_cache")
                warnings.warn(
                    f"Environment variable not set: {env_var}, writing "
                    f"to {root_cache_path}. To suppress this warning, set "
                    f"the environment variable {env_var} to the desired path "
                    f"for the testing cache."
                )

                # This is cleaned up once the program finishes.
                os.environ[env_var] = str(root_cache_path)

        root_cache_path = os.path.expanduser(root_cache_path)

        if path is None:
            path = root_cache_path

        cache_path = os.path.join(os.path.expanduser(path), sub_path)

        if not os.path.exists(cache_path):
            os.makedirs(cache_path)

        # Now check if the local file exists and download if not
        try:
            local_path = os.path.join(cache_path, filename)
            url = GEMINI_ARCHIVE_URL + filename

            if cache and os.path.exists(local_path):
                # Use the cached file
                return local_path

            tmp_path = download_file(url, cache=False)

            shutil.move(tmp_path, local_path)

            # `download_file` ignores Access Control List - fixing it
            os.chmod(local_path, 0o664)

        except Exception as err:
            if not fail_on_error:
                log.debug(f"Failed to download {filename} from the archive")
                log.debug(f" - Error: {err}")
                return None

>           raise IOError(
                f"Failed to download {filename} from the archive ({url})"
            ) from err
E           OSError: Failed to download N20160524S0119.fits from the archive (https://archive.gemini.edu/file/N20160524S0119.fits)

astrodata/testing.py:492: OSError

@aaryapatil
Copy link

aaryapatil commented Jul 15, 2024

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

  • As the reviewer I confirm that there are no conflicts of interest for me to review this work (If you are unsure whether you are in conflict, please speak to your editor before starting your review).

Documentation

The package includes all the following forms of documentation:

  • A statement of need clearly stating problems the software is designed to solve and its target audience in README.
  • Installation instructions: for the development version of the package and any non-standard dependencies in README.
  • Vignette(s) demonstrating major functionality that runs successfully locally.
  • Function Documentation: for all user-facing functions.
  • Examples for all user-facing functions.
    • ☝️The only available examples are those using Gemini, which requires installing DRAGONS and its dependencies. Filling in the User and/or Developer examples would be useful, especially with more examples of classes that inherit from astrodata. You could consider expanding the manual example here. Also refer to issue 33.
  • Community guidelines including contribution guidelines in the README or CONTRIBUTING.
  • Metadata including author(s), author e-mail(s), a url, and any other relevant metadata e.g., in a pyproject.toml file or elsewhere.
    • ☝️ It would be good to include url(s) for the project in pyproject.toml. Refer to the packaging guide here.

Readme file requirements
The package meets the readme requirements below:

  • Package has a README.md file in the root directory.

The README should include, from top to bottom:

  • The package name
  • Badges for:
    • Continuous integration and test coverage,
    • Docs building (if you have a documentation website),
    • A repostatus.org badge,
      • ☝️ It would be useful to include this badge
    • Python versions supported,
      • ☝️ It would be useful to include this badge, especially for quick installations in environments.
    • Current package version (on PyPI / Conda).

NOTE: If the README has many more badges, you might want to consider using a table for badges: see this example. Such a table should be more wide than high. (Note that the a badge for pyOpenSci peer-review will be provided upon acceptance.)

  • Short description of package goals.
  • Package installation instructions
  • Any additional setup required to use the package (authentication tokens, etc.)
  • Descriptive links to all vignettes. If the package is small, there may only be a need for one vignette which could be placed in the README.md file.
    • Brief demonstration of package usage (as it makes sense - links to vignettes could also suffice here if package description is clear)
  • Link to your documentation website.
  • If applicable, how the package compares to other similar packages and/or how it relates to other packages in the scientific ecosystem.
    • ☝️ For a first time reader going through the README, it might be worthwhile to expand upon the utility of astrodata beyond using astropy.io.fits alone. The current text only briefly mentions the extendability of astrodata.
  • Citation information
    • ☝️ CITATION.md is present but hasn't been referenced in the README.md. The README.md is the first point of entry into the project GitHub, so adding a citation there is important.

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole.
Package structure should follow general community best-practices. In general please consider whether:

  • Package documentation is clear and easy to find and use.
  • The need for the package is clear
  • All functions have documentation and associated examples for use
  • The package is easy to install

Functionality

  • Installation: Installation succeeds as documented.
    • Remove "." following poetry install in quickstart (fixed)
  • Functionality: Any functional claims of the software been confirmed.
  • Performance: Any performance claims of the software been confirmed.
  • Automated tests:
    • All tests pass on the reviewer's local machine for the package version submitted by the author. Ideally this should be a tagged version making it easy for reviewers to install.
      • ☝️Running tests with poetry run tox -rp (as suggested in the quickstart) works well, but skips the coverage, probably because a specific version of python wasn't installed. Traceback is available below. It would be useful to add more information for testing a specific version of python.
    • Tests cover essential functions of the package and a reasonable range of inputs and conditions.
  • Continuous Integration: Has continuous integration setup (We suggest using Github actions but any CI platform is acceptable for review)
  • Packaging guidelines: The package conforms to the pyOpenSci packaging guidelines.
    A few notable highlights to look at:
    • Package supports modern versions of Python and not End of life versions.
    • Code format is standard throughout package and follows PEP 8 guidelines (CI tests for linting pass)

For packages also submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

  • A short summary describing the high-level functionality of the software
  • Authors: A list of authors with their affiliations
  • A statement of need clearly stating problems the software is designed to solve and its target audience.
  • References: With DOIs for all those that have one (e.g. papers, datasets, software).

Final approval (post-review)

  • The author has responded to my review and made changes to my satisfaction. I recommend approving this package.

Estimated hours spent reviewing: 10


Review Comments

Thank you for submitting this package @teald. I have completed my first pass, so you can go through it now. The package is already in great shape! If there are any questions, let me know. I will also be in touch in case something else comes up. I am looking forward to further using astrodata.

Pull requests opened as part of review:

Poetry test run
py312: SKIP ⚠ in 0.02 seconds
py310: SKIP ⚠ in 0.02 seconds
py311: OK ✔ in 2 minutes 36.38 seconds
py310: SKIP (0.02 seconds)
py311: OK (156.38=setup[12.41]+cmd[0.00,4.36,139.60] seconds)
py312: SKIP (0.02 seconds)
coverage: SKIP (0.02 seconds)
congratulations :) (156.44 seconds)

@teald
Copy link
Author

teald commented Jul 22, 2024

Thank you both for your reviews! This feedback has been incredibly useful. I will continue working on these points and will respond to more general comments later this week or next week.

Update August 6: things have been taking a bit longer than anticipated. I'm hoping to have this wrapped up & a response ~this upcoming Monday (Aug. 12)

Update August 15: there have been some setbacks, I will just respond when this is finished. Hopefully soon 🤞 without further issues. Thanks for your patience!

Update September 6: We are waiting on feedback from a collaborator/contributor on the reviewer response, and should have it ready next week :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
4/reviews-in-awaiting-changes astropy An astropy community affiliated package review
Projects
Status: under-review
Development

No branches or pull requests

5 participants