Output JSON sidecar files after processing a la BIDS #3394

jcohenadad · 2021-05-18T15:06:14Z

In the context processing pipeline and databases, in order to keep track of the software and version used to generate derivatives, it would be useful if SCT would output a JSON sidecar (following BIDS convention). E.g.:

sct_deepseg_sc -i -FILE.nii -c t2

Would output:

FILE_seg.nii
FILE_seg.json

And the content of the JSON could be the following (inspired by this link):

{
  "Name": "FMRIPREP Outputs",
  "BIDSVersion": "1.4.0",
  "DatasetType": "derivative",
  "GeneratedBy": [
    {
      "Name": "sct",
      "Version": "5.3.0",
    },
    {
      "Author": "John Doe",
      "Description": "Manual correction of spinal cord segmentation",
      "Date": "2021-05-10 10:10:28",
    }
  ],
  "SourceDatasets": [
    {
      "DOI": "XXX",
      "URL": "data.neuro.polymtl.ca:datasets/uk-biobank-processed",
      "Version": "1.0.1"
    }
  ]
}

Another useful link: https://hackmd.io/@effigies/bids-derivatives-readme

The text was updated successfully, but these errors were encountered:

valosekj · 2023-06-27T22:39:53Z

We came across this point again with @tzebre and @jcohenadad when discussing if segmentations produced by the nnU-Net framework should be accompanied by JSON sidecar (see discussion here).

Since this point is periodically repeated from time to time (#2807, #4023), I am increasing its priority.

joshuacwnewton · 2023-07-07T15:28:11Z

At first I thought that this might be possible to do using our compat/launcher.py file, since this file contains a set of steps that are run for every available script. I was envisioning some very generic code -- a barebones json file containing the SCT command, SCT version, and date, which launcher.py would definitely have access to. This would let us avoid repeating ourselves in every script.

But, there are two main issues with this:

The output folder varies from script to script, and also varies at runtime via the -o/-ofolder options. (Related: #4070). This is an issue because launcher.py can't access the resulting output folder from outside the script.
We have SCT CLI scripts that we don't want to generate sidecar files for, namely ones that don't generate an output image (e.g. sct_version, sct_compute_hausdorff_distance, etc.).

In theory, we could fix both of these problems by propagating the output files/output directory back to launcher.py after the script completes. (And, for scripts that we don't want to generate a sidecar for, maybe we could return None?)

Aside: Specifying return values for main functions would have added benefits for our test suite, since we could be explicit with which files are generated by each script, and use that to more easily validate (and clean up) output files.

However, fetching a return value would require us to change how launcher.py is written (since currently we're using a subprocess):

spinalcordtoolbox/spinalcordtoolbox/compat/launcher.py

Line 47 in e740edf

return subprocess.run(cmd, env=env).returncode

But, maybe this would be a welcome change? Does using a subprocess incur a performance hit right now? ((Changing the subprocess behavior might also let us move the init_sct() function call inside the wrapper, too.))

Otherwise, if these changes are too broad in scope, then we could just as easily make a simple function and call it at the end of each script, too. I guess it just depends on how badly we want a within-process wrapper for executing code before and after each CLI script? 🤔

mguaypaq · 2023-07-07T15:40:06Z

My first thoughts are that:

Indeed it's not every script that should generate a sidecar. (And possibly some scripts produce multiple images, in which case there should be a sidecar for each?) So, the mechanism has to be conditional.
Yes, the launcher is missing some of the information (most importantly, output location), so maybe it shouldn't have the responsibility for generating the file. But, maybe it's a good place to generate some of the contents of the sidecar?
On the other hand, I think the main upside for generating the sidecar in the launcher is that it's such a strong nudge for script writers.
Some other ideas to explore:
- Is this sidecar functionality similar to the QC report functionality?
- Or maybe the code for generating a sidecar should live close to the image saving code?

jcohenadad · 2023-07-07T15:56:41Z

On the other hand, I think the main upside for generating the sidecar in the launcher is that it's such a strong nudge for script writers.

@mguaypaq I'm not sure I understand that argument. Are you referring to users who write SHELL scripts that consist of a sequence of SCT commands to process their data (eg: batch_processing.sh)? If so, I don't see how generating the sidecar in the launcher (vs. in individual SCT's CLI scripts) will encourage people to use scripts (because for them, it will be opaque where the sidecars are actually generated).

Overall, this discussion is related to the long "-o/-ofolder" debate. I am wondering if we could think of a -obids flag, that would output things in the derivatives according to BIDS. And if it is on, it would generate the sidecar. Then, we could implement the -obids flag in the appropriate CLIs, and so we would not worry about the launcher generating sidecars for scripts like sct_version (on which -obids would not be implemented).

mguaypaq · 2023-07-07T18:06:08Z

@mguaypaq I'm not sure I understand that argument. Are you referring to users who write SHELL scripts that consist of a sequence of SCT commands to process their data (eg: batch_processing.sh)?

Sorry for being unclear, I meant writers of new sct_* commands (most likely, students or postdocs at NeuroPoly working on SCT itself). So, if the "template" for new sct_* commands asked for sidecar information by default (for example, returned by main() as @joshuacwnewton suggested), whoever programs it would necessarily have to consider sidecars.

joshuacwnewton · 2023-07-18T17:55:46Z

Small update: I'm going to start experimenting with a launcher.py prototype today. :)

joshuacwnewton · 2024-03-11T16:05:51Z

In our 2024-02-21 SCT Meeting, we discussed JSON sidecars and how they should be implemented.

We came to the following conclusions:

The main motivation for sidecar files comes from the manual-correction pipeline. Meaning, sidecar generation is most useful for tracking the provenance of segmentation files that may be manually corrected in the future. (More context in this comment.)
So, this feature doesn't necessarily need to be implemented for every SCT script that generates output. Instead, we can start with sct_propseg/sct_deepseg_sc/sct_deepseg, then expand the functionality if need be.

This should limit the scope of this feature and make it much more straightforward to implement, as we only need to ensure compatibility with manual-correction for now. :)

jcohenadad mentioned this issue May 18, 2021

Start tagging releases of dataset so we can include that information in the derivatives sct-pipeline/ukbiobank-spinalcord-csa#70

Open

joshuacwnewton mentioned this issue Feb 1, 2023

Feature Request: sct_deepseg_lesion - output optional json file to allow for bids-compliant processing? #4023

Closed

joshuacwnewton changed the title ~~Output JSON file after processing~~ Output JSON sidecar files after processing a la BIDS Feb 1, 2023

joshuacwnewton mentioned this issue Feb 1, 2023

Output json sidecars a la BIDS from various functions #2807

Closed

valosekj added the priority:HIGH label Jun 27, 2023

valosekj mentioned this issue Jun 27, 2023

json creation when converting from nnUNetV2 to BIDS ivadomed/utilities#20

Open

joshuacwnewton assigned mguaypaq, joshuacwnewton and valosekj Jul 7, 2023

joshuacwnewton added this to the 6.1 milestone Jul 7, 2023

jcohenadad mentioned this issue Jul 10, 2023

Allow other suffixes than -manual spinalcordtoolbox/manual-correction#48

Closed

valosekj mentioned this issue Jul 14, 2023

Discuss JSON sidecar usage for correcting/verifying segmentations spinalcordtoolbox/manual-correction#34

Closed

joshuacwnewton mentioned this issue Jul 19, 2023

Add prototype for generating JSON sidecar files using launcher.py #4164

Closed

mguaypaq modified the milestones: 6.1, 6.2 Sep 7, 2023

valosekj mentioned this issue Sep 15, 2023

New dataset marseille-3T-mp2rage neuropoly/data-management#260

Closed

valosekj mentioned this issue Jan 25, 2024

Track algorithm/model used to correct a label spinalcordtoolbox/manual-correction#75

Open

mguaypaq added priority:MEDIUM and removed priority:HIGH labels Feb 12, 2024

mguaypaq modified the milestones: 6.2, 6.3, 6.4 Feb 12, 2024

valosekj mentioned this issue Feb 21, 2024

Allow to specify custom metadata to be included in JSON sidecars spinalcordtoolbox/manual-correction#80

Merged

jcohenadad mentioned this issue May 1, 2024

Indicate versions of sct_deepseg models #4465

Closed

joshuacwnewton mentioned this issue May 2, 2024

Track sct_deepseg model provenance with source.json (in model folder) and JSON sidecar (in output) #4466

Merged

joshuacwnewton unassigned mguaypaq and valosekj May 3, 2024

mguaypaq closed this as completed in #4466 May 14, 2024

mguaypaq closed this as completed in 3b5a746 May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output JSON sidecar files after processing a la BIDS #3394

Output JSON sidecar files after processing a la BIDS #3394

jcohenadad commented May 18, 2021 •

edited

Loading

valosekj commented Jun 27, 2023

joshuacwnewton commented Jul 7, 2023

mguaypaq commented Jul 7, 2023

jcohenadad commented Jul 7, 2023 •

edited

Loading

mguaypaq commented Jul 7, 2023

joshuacwnewton commented Jul 18, 2023

joshuacwnewton commented Mar 11, 2024

Output JSON sidecar files after processing a la BIDS #3394

Output JSON sidecar files after processing a la BIDS #3394

Comments

jcohenadad commented May 18, 2021 • edited Loading

valosekj commented Jun 27, 2023

joshuacwnewton commented Jul 7, 2023

mguaypaq commented Jul 7, 2023

jcohenadad commented Jul 7, 2023 • edited Loading

mguaypaq commented Jul 7, 2023

joshuacwnewton commented Jul 18, 2023

joshuacwnewton commented Mar 11, 2024

jcohenadad commented May 18, 2021 •

edited

Loading

jcohenadad commented Jul 7, 2023 •

edited

Loading