DM-40392: Add methods to update run collection name in a quantum #882

andy-slac · 2023-08-17T23:03:14Z

QuantumGraph.updateRun() method needs a better implementation and these new methods in DatasetRef and Quantum classes are for supporting that implementation.

Checklist

ran Jenkins
added a release note for user-visible changes to doc/changes

timj · 2023-08-17T23:10:18Z

The ruff problem comes from ruff 0.285 with astral-sh/ruff#6465

timj · 2023-08-17T23:13:50Z

python/lsst/daf/butler/core/datasets/ref.py

@@ -706,11 +706,48 @@ def overrideStorageClass(self, storageClass: str | StorageClass) -> DatasetRef:
            A new dataset reference that is the same as the current one but
            with a different storage class in the `DatasetType`.
        """
+        return self.update(storage_class=storageClass)
+
+    def update(


Maybe replace() to match the name used in dataclasses? update doesn't immediately suggest that a new version is being returned. I called the equivalent ResourcePath.replace for that reason.

timj · 2023-08-17T23:30:17Z

python/lsst/daf/butler/core/dimensions/_records.py

@@ -190,7 +190,7 @@ def direct(
        # This method requires tuples as values of the mapping, but JSON
        # readers will read things in as lists. Be kind and transparently
        # transform to tuples
-        _recItems = {k: v if type(v) != list else tuple(v) for k, v in record.items()}  # type: ignore
+        _recItems = {k: v if not isinstance(v, list) else tuple(v) for k, v in record.items()}  # type: ignore


I think we deliberately used type() here for speed since we know it can only be a list coming from JSON and we need this to be fast. We don't want to worry about subclasses of list here.

Latest ruff was complaining about it, should I just suppress this?

And why don't we always use tuple(v)? tuple(v) will return v if it's already a tuple.

Hmm. That might be a better answer. Let the tuple() constructor deal with it.

Ah, not all items are supposed to be tuples there, sorry for the noise.

Right, we only want lists to migrate. I think adding a noqa will work but you'll also have to adjust pyproject.toml to stop ruff complaining about unused noqa -- the rubin-env ruff used in Jenkins will not trigger an error and will instead get upset by the unused noqa.

codecov · 2023-08-18T05:08:24Z

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (60b7293) 87.70% compared to head (cda19fe) 87.71%.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #882   +/-   ##
=======================================
  Coverage   87.70%   87.71%           
=======================================
  Files         274      274           
  Lines       36195    36219   +24     
  Branches     7575     7578    +3     
=======================================
+ Hits        31746    31770   +24     
  Misses       3272     3272           
  Partials     1177     1177

Files Changed	Coverage Δ
python/lsst/daf/butler/core/datasets/ref.py	`80.86% <100.00%> (+0.77%)`	⬆️
tests/test_datasets.py	`100.00% <100.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

timj

Please add trivial test for update_output_run and update_input_run.

If these two methods have to be called in exactly the right way and solely from QuantumGraph.updateRun maybe they should be private methods in the sense that middleware sometimes knows it can call private methods in different packages but we don't want them public? Else it begs the question why isn't there a quantum.update_run that combines these two methods?

timj · 2023-08-18T17:58:50Z

python/lsst/daf/butler/core/datasets/ref.py

+        Returns
+        -------
+        modified : `DatasetRef`
+            A new dataset reference with updated attributes.


If all 3 parameters are None do you want to return self instead?

I could but I do not want an additional test for all None, I don't think it's going to used with no parameters.

timj · 2023-08-18T18:07:46Z

python/lsst/daf/butler/core/quantum.py

+            refs: Iterable[DatasetRef], run: str, dataset_id_map: Mapping[DatasetId, DatasetId]
+        ) -> Iterator[DatasetRef]:
+            for ref in refs:
+                if dataset_id := dataset_id_map.get(ref.id):


Add comment that this is for intermediates only and if it's not already in dataset_id_map the ref won't be modified?

andy-slac · 2023-08-18T19:08:44Z

I agree that those two new methods in Quantum should be considered private. Maybe I need to move both of them to Quantumgraph as that is the only place that needs them, let me try that.

andy-slac · 2023-08-18T20:38:54Z

I moved that private code to QuantumGraph.updateRun, does not look too terrible.

QuantumGraph.updateRun() method needs a better implementation and these new methods in DatasetRef and Quantum classes are for supporting that implementation.

timj reviewed Aug 17, 2023

View reviewed changes

andy-slac force-pushed the tickets/DM-40392 branch from ccf6e2c to 763189d Compare August 17, 2023 23:25

timj reviewed Aug 17, 2023

View reviewed changes

andy-slac force-pushed the tickets/DM-40392 branch from 763189d to a99425a Compare August 17, 2023 23:50

timj mentioned this pull request Aug 17, 2023

DM-39514: Fix misleading doc for Registry.queryDatasetAssociations. #883

Merged

andy-slac force-pushed the tickets/DM-40392 branch 2 times, most recently from 7be0a4c to 4cf9cb8 Compare August 18, 2023 04:55

andy-slac force-pushed the tickets/DM-40392 branch from 4cf9cb8 to 6d29504 Compare August 18, 2023 16:07

This was referenced Aug 18, 2023

DM-40392: Update unit test after fixing QuantumGraph.updateRun lsst/ctrl_mpexec#261

Merged

DM-40392: Re-implement QuantumGraph.updateRun method lsst/pipe_base#369

Merged

timj approved these changes Aug 18, 2023

View reviewed changes

andy-slac force-pushed the tickets/DM-40392 branch 2 times, most recently from 89362da to a900423 Compare August 18, 2023 20:37

andy-slac force-pushed the tickets/DM-40392 branch from a900423 to 4424314 Compare August 18, 2023 21:08

Add methods to update run collection name in a quantum (DM-40392)

cda19fe

QuantumGraph.updateRun() method needs a better implementation and these new methods in DatasetRef and Quantum classes are for supporting that implementation.

andy-slac force-pushed the tickets/DM-40392 branch from 4424314 to cda19fe Compare August 18, 2023 23:40

andy-slac merged commit c732061 into main Aug 19, 2023
16 checks passed

andy-slac deleted the tickets/DM-40392 branch August 19, 2023 02:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-40392: Add methods to update run collection name in a quantum #882

DM-40392: Add methods to update run collection name in a quantum #882

andy-slac commented Aug 17, 2023 •

edited

Loading

timj commented Aug 17, 2023

timj Aug 17, 2023

timj Aug 17, 2023

andy-slac Aug 17, 2023

andy-slac Aug 17, 2023

timj Aug 17, 2023

andy-slac Aug 17, 2023

timj Aug 17, 2023

codecov bot commented Aug 18, 2023 •

edited

Loading

timj left a comment

timj Aug 18, 2023

andy-slac Aug 18, 2023

timj Aug 18, 2023

andy-slac commented Aug 18, 2023

andy-slac commented Aug 18, 2023

DM-40392: Add methods to update run collection name in a quantum #882

DM-40392: Add methods to update run collection name in a quantum #882

Conversation

andy-slac commented Aug 17, 2023 • edited Loading

Checklist

timj commented Aug 17, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Aug 18, 2023 • edited Loading

Codecov Report

timj left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andy-slac commented Aug 18, 2023

andy-slac commented Aug 18, 2023

andy-slac commented Aug 17, 2023 •

edited

Loading

codecov bot commented Aug 18, 2023 •

edited

Loading