ENH: add subgraph method to Graph to get subsets #640

martinfleis · 2023-11-10T14:52:24Z

I have found myself in a need to create subsets of a large graph covering only specific portions of my geometries. Hence I have developed a method to create a subgraph since making a subset of adjacency and passing that to a constructor discards isolates.

martinfleis · 2023-11-10T14:52:58Z

libpysal/graph/_utils.py

-    islands = np.setdiff1d(ids, heads)
+    islands = pd.Index(ids).difference(pd.Index(heads))


This turned out to be several orders of magnitude faster for large string indices.

codecov · 2023-11-10T15:13:37Z

Codecov Report

Merging #640 (77bd0ff) into main (79c4f82) will increase coverage by 0.0%.
Report is 1 commits behind head on main.
The diff coverage is 100.0%.

@@          Coverage Diff          @@
##            main    #640   +/-   ##
=====================================
  Coverage   83.9%   83.9%           
=====================================
  Files        139     139           
  Lines      14970   14976    +6     
=====================================
+ Hits       12562   12569    +7     
+ Misses      2408    2407    -1

Files	Coverage Δ
libpysal/graph/_utils.py	`89.3% <100.0%> (ø)`
libpysal/graph/base.py	`97.7% <100.0%> (+<0.1%)`	⬆️
libpysal/graph/tests/test_base.py	`100.0% <100.0%> (ø)`

... and 6 files with indirect coverage changes

knaaptime · 2023-11-10T17:10:42Z

cool. I wonder if we can use this to enhance pysal/esda#259 as discussed over there (e.g by computing the largest range first, then successively cutting down the graph). Seems like you'd still need a tree query to get the indices though?

martinfleis · 2023-11-10T18:45:22Z

I wonder if we can use this to enhance pysal/esda#259 as discussed over there (e.g by computing the largest range first, then successively cutting down the graph). Seems like you'd still need a tree query to get the indices though?

I don't think so. This aims to create a subgraph based on a subset of focals. In correlogram, you need to keep the same focals but cut their neighbors.

What you could doe once #635 is in is something like this:

for i in distances:
    adj = graph_w_distance.adjacency.copy()
    adj[adj > i] = 0
    smaller_graph = graph.Graph(adj, is_sorted=True).transform("r")
    # compute stats using smaller_graph

This assumes that graph_w_distance has weight == distance. Right now, there is the sorting bottleneck in this code but after #635 that will be gone.

edit: you could eventually call eliminate_zeros() from #634 on that as well before transform to get a cleaner graph but I am less sure about perf benefits of that.

ENH: add subgraph method to Graph to get subsets

7672131

martinfleis added enhancement graph labels Nov 10, 2023

martinfleis requested review from sjsrey, ljwolf, knaaptime and jGaboardi November 10, 2023 14:52

martinfleis self-assigned this Nov 10, 2023

martinfleis commented Nov 10, 2023

View reviewed changes

no dtype check to make windows green

77bd0ff

jGaboardi approved these changes Nov 10, 2023

View reviewed changes

ljwolf approved these changes Nov 13, 2023

View reviewed changes

martinfleis merged commit 85e6d5f into pysal:main Nov 19, 2023

martinfleis deleted the subset branch November 19, 2023 09:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: add subgraph method to Graph to get subsets #640

ENH: add subgraph method to Graph to get subsets #640

martinfleis commented Nov 10, 2023

martinfleis Nov 10, 2023

codecov bot commented Nov 10, 2023 •

edited

Loading

knaaptime commented Nov 10, 2023

martinfleis commented Nov 10, 2023 •

edited

Loading

		islands = np.setdiff1d(ids, heads)
		islands = pd.Index(ids).difference(pd.Index(heads))

ENH: add subgraph method to Graph to get subsets #640

ENH: add subgraph method to Graph to get subsets #640

Conversation

martinfleis commented Nov 10, 2023

martinfleis Nov 10, 2023

Choose a reason for hiding this comment

codecov bot commented Nov 10, 2023 • edited Loading

Codecov Report

knaaptime commented Nov 10, 2023

martinfleis commented Nov 10, 2023 • edited Loading

codecov bot commented Nov 10, 2023 •

edited

Loading

martinfleis commented Nov 10, 2023 •

edited

Loading