Improvements to the HST/STIS data loaders #1233

sean-lockwood · 2025-05-02T14:19:01Z

Echelle datasets

HST/STIS can have multivalued wavelengths where echelle orders overlap. Simply sorting by wavelength will interleave high- and low-SNR data near the edges of these orders. Rather than solving the complex task of combining orders into a single 1D spectrum, we created a second loader, "HST/STIS multi", to handle loading individual orders into rows of a SpectrumCollection object.

Orders (each with independent wavelength solutions) overlap, so simply sorting by wavelength and combining all orders leads to non-physical Δλ. Here's an example showing the overlaps:

The edges of orders have especially variable throughput and SNR (mainly due to the blaze function shape), so interleaving in the overlap regions would be non-physical:

Multi-read datasets

Additionally, some STIS datasets contain multiple reads (i.e. when REPEATOBS > 1, or CRSPLIT > 1 and CRCORR="OMIT"). These reads are stored as additional FITS extensions. Any multi-read dataset also produces a SpectrumCollection object via the "HST/STIS multi" reader. The old reader used only the first extension.

Metadata

We supplemented the meta information with the relevant SCI extension header, as well as the scalar table values describing the 1D extraction parameters for each order.

Data masking

We used the STIS data quality (DQ) array, masked via bitwise-and with an SDQFLAGS (serious DQ flags) scalar value to produce a boolean mask. STIS DQ values are integers, allowing the user to choose which DQ flags are relevant to their analysis via the selection of the SDQFLAGS value. By default, this value is read from the relevant SCI extension header. We also provided the capability for users to override this value by specifying "sdqflags=" to the STIS data readers.

Caveat about echelle wavelength ordering

Note that while the wavelengths of any individual STIS echelle order are in ascending ordered, the order of the wavelengths across echelle orders is descending (corresponding to increasing SPORDER number). This was chosen to remain consistent with the source data.

Flux units

While "Unit('erg/cm**2 Angstrom s')" appears to evaluate as intended, we used a more explicit formulation.

Identifiers

We modified the identifiers to not raise an exception upon the non-detection of a FITS header keyword.

Testing

Updated HST/STIS test_loaders tests.
New test data covering different file dimensionality are available at:
https://zenodo.org/records/15320389

Updated `CHANGES.rst`

Echelle datasets: STIS can have multivalued wavelengths where echelle orders overlap. Simply sorting by wavelength will interleave high- and low-SNR data near the edges of these orders. Rather than solving the complex task of combining orders into a single 1D spectrum, we created a second loader, "HST/STIS multi", to handle loading individual orders into rows of a SpectrumCollection object. Multi-read datasets: Additionally, some STIS datasets contain multiple reads (i.e. when REPEATOBS > 1 or OCRREJECT="OMIT"). These reads are stored as additional FITS extensions. Any multi-read dataset also produces a SpectrumCollection object via the "HST/STIS multi" reader. Metadata: We supplemented the meta information with the relevant SCI extension header, as well as the scalar table values describing the 1D extraction parameters for each order. Data masking: We used the STIS data quality (DQ) array, masked via bitwise-and with an SDQFLAGS (serious DQ flags) scalar value to produce a boolean mask. STIS DQ values are integers, allowing the user to choose which DQ flags are relevant to their analysis via the selection of the SDQFLAGS value. By default, this value is read from the relevant SCI extension header. We also provided the capability for users to override this value by specifying "sdqflags=<int>" to the STIS data readers. Caveat about echelle wavelength ordering: Note that while the wavelengths of any individual STIS echelle order are in ascending ordered, the order of the wavelengths across echelle orders is descending (corresponding to increasing SPORDER number). This was chosen to remain consistent with the source data. Flux units: While "Unit('erg/cm**2 Angstrom s')" appears to evaluate as intended, we used a more explicit formulation. Identifiers: We modified the identifiers to not raise an exception upon the non-detection of a FITS header keyword.

New test data covering different file dimensionality are available at: https://zenodo.org/records/15320389

sean-lockwood · 2025-05-02T14:30:37Z

Docs

We should probably also update the docs for "Working With SpectrumCollections" page
to include this list of readers as done with Spectrum1D (SpectrumCollection.read.list_formats()).

specutils/io/default_loaders/hst_stis.py

…ader'.

sean-lockwood and others added 4 commits May 1, 2025 18:08

Updated HST/STIS test_loaders tests.

5b37c01

New test data covering different file dimensionality are available at: https://zenodo.org/records/15320389

Merge branch 'astropy:main' into stis-loader-update

cbf20c4

Added "HST/STIS" loaders update to CHANGES.rst

7317d59

sean-lockwood requested review from eteq, rosteen and keflavich as code owners May 2, 2025 14:19

sean-lockwood commented May 2, 2025

View reviewed changes

specutils/io/default_loaders/hst_stis.py Outdated Show resolved Hide resolved

sean-lockwood and others added 3 commits May 7, 2025 13:32

Merge branch 'main' into stis-loader-update

7117586

Added PR astropy#1233 reference to changelog.

dbe5967

Changed hst_stis loader to rename primary FITS header in metadata 'he…

4b16a51

…ader'.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improvements to the HST/STIS data loaders #1233

Improvements to the HST/STIS data loaders #1233

sean-lockwood commented May 2, 2025 •

edited

Loading

Uh oh!

sean-lockwood commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Improvements to the HST/STIS data loaders #1233

Are you sure you want to change the base?

Improvements to the HST/STIS data loaders #1233

Conversation

sean-lockwood commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Echelle datasets

Multi-read datasets

Metadata

Data masking

Caveat about echelle wavelength ordering

Flux units

Identifiers

Testing

Updated CHANGES.rst

Uh oh!

sean-lockwood commented May 2, 2025

Docs

Uh oh!

Uh oh!

Uh oh!

sean-lockwood commented May 2, 2025 •

edited

Loading

Updated `CHANGES.rst`