Added batching in transductive setting #128

Coerulatus · 2024-12-18T16:36:25Z

Hello everyone,

I have added the possibility of batching the data in the transductive setting.
When working with large graphs, selecting a subset of the graph while keeping the model's performance unchanged for the desired nodes can drastically reduce the memory requirements during training and inference.
In torch_geometric, the NeighborLoader performs neighbor sampling to achieve this. This can be done because, in the normal message-passing framework, the information propagates only as far as the number of message-passing steps performed.
The newly added NeighborCellsLoader works similarly but it also selects the relevant higher-order cells, by sequentially reducing all the incidences.
In the loader, you can also specify the rank to consider, meaning that you can perform batching over the nodes, edges, or any higher-order cell.

I have also added a tutorial that shows the basic functionality of NeighborCellsLoader. It also tests that the approach works as expected by comparing the model's outputs working with the full graph or with the batched one. Interestingly the number of hops needed is not necessarily equal to the number of layers in the higher-order networks. Information, at each layer, can in general travel further than the 1-neighborhood when working with these models.

…Benchmark into batching

review-notebook-app · 2024-12-18T16:36:30Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov · 2024-12-18T17:38:57Z

Codecov Report

Attention: Patch coverage is 92.72727% with 16 lines in your changes missing coverage. Please review.

Project coverage is 90.16%. Comparing base (9271ec4) to head (3f44880).
Report is 9 commits behind head on main.

Files with missing lines	Patch %	Lines
topobenchmark/data/batching/cell_loader.py	85.18%	8 Missing ⚠️
topobenchmark/data/batching/utils.py	94.95%	6 Missing ⚠️
...pobenchmark/data/batching/neighbor_cells_loader.py	96.15%	1 Missing ⚠️
topobenchmark/nn/readouts/propagate_signal_down.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #128      +/-   ##
==========================================
+ Coverage   89.51%   90.16%   +0.65%     
==========================================
  Files         126      130       +4     
  Lines        3518     3732     +214     
==========================================
+ Hits         3149     3365     +216     
+ Misses        369      367       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

levtelyatnikov and others added 26 commits October 31, 2024 19:35

some random error

fe2d5e7

run

82b3ede

resolver_error

c225cc6

Merge branch 'main' of https://github.com/geometric-intelligence/Topo…

8a3714b

…Benchmark into batching

start the developments of node level batching

9cb5def

Marco - added batching functions

c988b2d

Marco - get_sampled_neighborhood reworked

877fce1

added proper plot function

09bab78

Marco - batching done

72f92ec

added some comments

76e1d03

get rid of random files

fb392ce

added just sampling over the graph

f623d24

Marco - defined NeighborCellsLoader

8519fef

merged changes

9488259

Marco - fixed conflict

5ec69b8

fixed __repr__ of readout

64c2c9f

support for multiple hops

e154231

changed DataloadDataset call

7bddf5d

test batching with multiple hops

69a0e94

Merge remote-tracking branch 'origin/main' into batching

54d428b

hydra already initialized

6a0e4ec

added test

78f5ed2

added batching in transductive setting

9758ff4

test mse when batching

4d80241

formatting

503512f

changed batch size for new TBDataloader

f92e378

Coerulatus added 2 commits December 18, 2024 17:05

ruff fixes

4a53d8f

fix temp folder

3f44880

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added batching in transductive setting #128

Added batching in transductive setting #128

Coerulatus commented Dec 18, 2024

review-notebook-app bot commented Dec 18, 2024

codecov bot commented Dec 18, 2024

Added batching in transductive setting #128

Are you sure you want to change the base?

Added batching in transductive setting #128

Conversation

Coerulatus commented Dec 18, 2024

review-notebook-app bot commented Dec 18, 2024

codecov bot commented Dec 18, 2024

Codecov Report