Refactor pipelines into classes #51

talmo · 2023-08-17T04:40:54Z

This PR constitutes a massive refactor started in #45 and #50 to fix how we compute our traits in a pipeline form.

While there are many changes, the key differences are that we now have the trait computation graph fully defined and structured as a set of classes in sleap_roots.trait_pipelines. They are organized into:

TraitDef: This defines the concept of a trait, including which other traits are used to compute it, its name, the function that is used to compute it, and additional metadata such as whether it's scalar (i.e., needs to be summarized) and whether it should be included in output CSVs. This allows us to be flexible in defining intermediate traits that may not be included in final analyses but that are necessary to compute other traits. Fully defining the inputs and outputs also allows us to compute all of the traits in the appropriate order by enforcing a topological ordering of the computation graph.
Pipeline: This is a base class which implements the generic trait computation steps, including frame-level, plant-level, and batch-level steps. Subclasses of Pipeline (such as DicotPipeline) can be defined by just inheriting from this class and defining two functions:
- define_traits: Returns a list of TraitDef that defines the set of traits computed by the pipeline.
- get_initial_frame_traits: Returns a dictionary of initial traits derived from the raw keypoint data. This is necessary because different pipeline types will have different initial traits depending on which root types are tracked (e.g., primary + lateral, primary + main, only main, etc.).

An example of how these are used:

from sleap_roots import Series, DicotPipeline
plant = Series.load(r"tests\data\soy_6do\6PR6AA22JK.h5", primary_name="primary_multi_day", lateral_name="lateral__nodes")
pipeline = DicotPipeline()

# Plant-level traits
traits = pipeline.compute_plant_traits(plant)
assert traits.shape == (72, 115)

# Batch level traits
all_traits = pipeline.compute_batch_traits([plant])
assert all_traits.shape == (1, 1018)

…s first)

…se graph and add comments. Delete duplicate `primary_depth`.

…ptional arguments

codecov · 2023-08-17T05:07:09Z

Codecov Report

Merging #51 (e71e8d9) into main (1d30dbd) will decrease coverage by 4.51%.
The diff coverage is 86.15%.

@@            Coverage Diff             @@
##             main      #51      +/-   ##
==========================================
- Coverage   81.40%   76.90%   -4.51%     
==========================================
  Files          13       13              
  Lines         726      762      +36     
==========================================
- Hits          591      586       -5     
- Misses        135      176      +41

Files Changed	Coverage Δ
sleap_roots/convhull.py	`76.11% <73.33%> (-6.74%)`	⬇️
sleap_roots/points.py	`78.57% <75.00%> (-18.49%)`	⬇️
sleap_roots/lengths.py	`76.92% <76.92%> (ø)`
sleap_roots/networklength.py	`81.48% <79.66%> (-11.06%)`	⬇️
sleap_roots/tips.py	`85.18% <81.81%> (-8.57%)`	⬇️
sleap_roots/bases.py	`88.46% <89.15%> (-3.13%)`	⬇️
sleap_roots/angle.py	`91.66% <89.74%> (-8.34%)`	⬇️
sleap_roots/scanline.py	`88.23% <92.30%> (-6.51%)`	⬇️
sleap_roots/trait_pipelines.py	`93.33% <93.33%> (ø)`
sleap_roots/__init__.py	`100.00% <100.00%> (ø)`
... and 2 more

... and 1 file with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

linwang9926 · 2023-08-17T16:03:29Z

sleap_roots/networklength.py

-    pts_all_array: np.ndarray,
+    primary_length: float,
+    lateral_lengths: Union[float, np.ndarray],
+    network_length_lower: float,
    fraction: float = 2 / 3,


Delete fraction as not used in this function

linwang9926 · 2023-08-17T16:04:24Z

sleap_roots/networklength.py

+        lateral_lengths: Lateral root lengths. Can be a single float (for one root)
+            or an array of floats (for multiple roots).
+        network_length_lower: The root length in the lower network.
+        fraction: The fraction of the network considered as 'lower'. Defaults to 2/3.


Delete the fraction

* Reorganizing the trait_map and modify ellipse, network functions. * Update ellipse argument default setting. * Reorganize ellipse and network functions by reducing arguments. * Map convex hull traits * Change primary to lateral when monocots is True * Get tips x and y coordinates uses network map * Change "stem" to "root" * Fix tip y map * Change root width back to take lateral_pts * Changing the order of positional arguments to match others (primary is first) * Fix plotting for using sleap-io API * Make positional arguments consistent * Refactor `get_base_xs` to use graph * Map `scanline_intersection_counts` and use keyword arguments * Refactor `get_base_ys`, `get_base_length`, and `base_ct_density` to use graph and add comments. Delete duplicate `primary_depth`. * Clean up dependencies. Fix tip_ys. * Refactor `get_root_lengths_max` for use with graph * Refactor `get_base_tip_dist` to make base and tip pts or all points optional arguments * Delete `primary_depth` * Delete traitsgraph * Delete traitsgraph dependencies * Refactor base-related traits to use graph optionally * Delete traits graph dependency * Use `get_primary_pts` from series class * Delete `get_primary_depth` tests * Fix trait map for base traits * Delete test for traitsgraph.py * Standardize trait definition in trait map * Change "graph" to "trait" * Fix docstrings in `get_bases` * Use `TraitDef` class * Fix docstrings * Add argument to class `TraitDef` whether to include in csv or if scalar * Change `attr` to `attrs` * Add `lengths.py` for length-related traits. * Add `primary_max_length_pts` to trait definitions * Add `pts_all_array` and `convex_hull` trait definitions * Fix docstring * Import base-related trait to `lengths.py` * Make sure arrays of points are 2-dimensional * Streamline point-related functions * Vectorize `get_node_ind` * Add trait definitions until `lateral_lengths` * Delete unnecessary code * Use node_ind for `get_root_angle` function. * Modify base functions by assuming primary_pts as the primary_length_max. * Modify argument pts as Optional in `get_base_tip_dist` function * Modify argument pts as Optional in `get_grav_index` function * Draft the trait_definitions using the defined TraitDef class. * Uppercase the `get_root_angle` function arg description. * Add test_lengths module for lengths-related functions. * Remove lengths-related functions from test_bases. * Set pts as Optional argument for `get_grav_index` function. * Change the module name for importing lengths-related functions. * Remove importing the points functions, only keep `get_all_pts_array`. * Test ellipse-related functions. * Redo the function `get_node_ind`. * Test function `get_node_ind`. * Angle function reset node_ind to array if only one value. * Angle function return nan if all Nan node, return value if single array. * Test angle functions. * Add network_width_depth_ratio in trait_definitions. * Reorganize arguments of `get_network_distribution_ratio` function. * Add `network_length` trait before calculating `network_solidity`. * Update `primary_root_length` function with calculated lengths. * Update `get_network_solidity` function with calculated network_length. * Test network-related functions. * Test points function (`get_all_pts_array`). * Update and test scanline functions using calculated scanline counts. * Refactor `get_root_pair_widths_projections` to take in `primary_max_length_pts` * Cleanup trait map * Fix tests for base-related traits * Add test for `get_max_lengths_pts` * Refactored `get_base_ct_density` to take `primary_length_max` and `lateral_base_pts` as arguments * Fixed multi-line strings * Refactor base-related traits * Refactor base-related traits and tests * Test root-length-related traits * Test tip-related traits * Refactor convex-hull-related traits * Test convex-hull functions * Lint * Lint * Lint * Lint * Lint * Fix kwargs involving `get_tips` in trait map * Fix input for pipeline tests * Refactor network related functions * Test pipeline * Refactor scanline function * Start refactoring pipeline into classes * Finish refactoring trait pipelines into classes * Runtime fixes * More refactoring to minimize redundant code across pipeline types * Rename module and fix tests * Add missing renamed modules * Fix summary tests * Fix Series to load video directly to bypass path resolution issues * Lint * Lint --------- Co-authored-by: Lin Wang <[email protected]> Co-authored-by: Elizabeth Berrigan <[email protected]>

Lin Wang and others added 30 commits July 20, 2023 15:31

Reorganizing the trait_map and modify ellipse, network functions.

c896369

Update ellipse argument default setting.

f874be3

Reorganize ellipse and network functions by reducing arguments.

c9f4931

Map convex hull traits

98ce00c

Change primary to lateral when monocots is True

d6bd4bb

Get tips x and y coordinates uses network map

4d7bc4d

Change "stem" to "root"

51d60ab

Fix tip y map

a73ebcd

Change root width back to take lateral_pts

fc155f8

Changing the order of positional arguments to match others (primary i…

ff9530d

…s first)

Fix plotting for using sleap-io API

c6e58a3

Merge branch 'elizabeth/fix_plotting' into elizabeth/pipeline_cache

1bd4ddd

Make positional arguments consistent

25e1047

Refactor get_base_xs to use graph

eb973bd

Map scanline_intersection_counts and use keyword arguments

fd30b78

Refactor get_base_ys, get_base_length, and base_ct_density to u…

4b529ae

…se graph and add comments. Delete duplicate `primary_depth`.

Clean up dependencies. Fix tip_ys.

296fcc7

Refactor get_root_lengths_max for use with graph

8f8a39a

Refactor get_base_tip_dist to make base and tip pts or all points o…

a25e8b9

…ptional arguments

Delete primary_depth

01ea8bc

Delete traitsgraph

617f379

Delete traitsgraph dependencies

383d454

Refactor base-related traits to use graph optionally

132ac7e

Delete traits graph dependency

3f3967f

Use get_primary_pts from series class

2f9465f

Delete get_primary_depth tests

7cf6817

Fix trait map for base traits

2f64aaf

Delete test for traitsgraph.py

a88dafc

Standardize trait definition in trait map

2294d14

Change "graph" to "trait"

7d4c612

eberrigan and others added 15 commits August 10, 2023 12:05

Lint

502b154

Lint

6ada71b

Lint

61a879a

Fix kwargs involving get_tips in trait map

ae2ff5a

Fix input for pipeline tests

3d2e5b7

Refactor network related functions

8b88a02

Test pipeline

0db548f

Refactor scanline function

12049d4

Start refactoring pipeline into classes

d4de693

Finish refactoring trait pipelines into classes

48d9247

Runtime fixes

31ef1f2

More refactoring to minimize redundant code across pipeline types

f71999c

Rename module and fix tests

4267a43

Add missing renamed modules

7caf85c

Fix summary tests

cf2a999

talmo requested review from eberrigan and linwang9926 August 17, 2023 04:42

talmo added 2 commits August 16, 2023 22:03

Fix Series to load video directly to bypass path resolution issues

d50b2cd

Lint

fa56d71

Lint

e71e8d9

talmo linked an issue Aug 17, 2023 that may be closed by this pull request

Potential speedups in pipeline runtime #35

Closed

eberrigan approved these changes Aug 17, 2023

View reviewed changes

This was referenced Aug 17, 2023

Add monocot pipelines #52

Closed

Implement high level trait computation methods #53

Open

linwang9926 reviewed Aug 17, 2023

View reviewed changes

eberrigan merged commit d4015f4 into main Aug 17, 2023
5 checks passed

eberrigan deleted the talmo/pipeline_class branch August 17, 2023 16:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor pipelines into classes #51

Refactor pipelines into classes #51

talmo commented Aug 17, 2023

codecov bot commented Aug 17, 2023 •

edited

Loading

linwang9926 Aug 17, 2023

linwang9926 Aug 17, 2023

Refactor pipelines into classes #51

Refactor pipelines into classes #51

Conversation

talmo commented Aug 17, 2023

codecov bot commented Aug 17, 2023 • edited Loading

Codecov Report

linwang9926 Aug 17, 2023

Choose a reason for hiding this comment

linwang9926 Aug 17, 2023

Choose a reason for hiding this comment

codecov bot commented Aug 17, 2023 •

edited

Loading