Improve the accuracy on ogbn-papers100M with GAT model, add undirected graph on ogbn-products #9467

guanxingithub · 2024-06-27T20:33:34Z

Improve the test accuracy to >60% on ogbn-papers100M consolidate repetitive examples

for more information, see https://pre-commit.ci

2. Add undirected graph on ogbn-products with Sage model 3. Add undirected graph on ogbn-products with GAT model

…ub/pytorch_geometric into performance_analysis

for more information, see https://pre-commit.ci

puririshi98 · 2024-09-17T17:00:06Z

@akihironitta @rusty1s anything else needed to merge?

for more information, see https://pre-commit.ci

akihironitta

This is great! 🚀

akihironitta · 2024-10-03T16:39:58Z

examples/ogbn_train.py

+
+    inference_start = time.time()
+    val_acc = test(val_loader)
+    test_acc = test(test_loader)


I don't understand why we need to evaluate the model on the test set every epoch.

agreed, @guanxingithub please make the test only happen at the end of all epochs of the training

All evaluations are used for data analysis after the training is done. With all of these statistical data collected during training, we can do further performance and accuracy analysis.

If you agree, I can put all of these extra testing and evaluations under verbose mode. So, from user point of view, they don't need to make these further testing and evaluations. But for someone who wants to make performance and accuracy analysis, she/he can turn on verbose mode to collect raw data of performance and accuracy for further statistical analysis.

no you should just not be running test during training, all other metrics are fine. please adjust accordingly

If we keep evaluation results on test set in each epoch, that will be very helpful to make further analysis. I did the same evaluation on the test set on other tools. If we can do the same evaluation in PyG, that will be very helpful to compare the accuracy between PyG and other tools apple to apple. Thanks.

that will be very helpful to make further analysis.

What kind of analysis are you referring to specifically?

performance and accuracy analysis cross platform and different tools and different version of tools.

For example, for platform1, and platform2, we collected all these performance and accuracy data, and then we can analyze these data obtained from both platforms, to make a judgment like the hypothesis: whether the performance is same or different, accuracy is difference or not from statistical point of view. If it is significant difference, what could cause the difference. This will build the foundation of our further investigation and improvement. We have done a lot of this kind of analysis not only in PyG, but also other tools.

akihironitta · 2024-10-03T16:41:14Z

examples/ogbn_train.py

+    if val_acc > best_val:
+        best_val = val_acc
+    if test_acc > best_test:
+        best_test = test_acc


Related to my above comment: I don't understand why it's ok to pick the best metric evaluated on the test set across all epochs.

well this was just for tracking it seems. there is no checkpointing being done but yeah the test set should only be run once at the end anyways. @guanxingithub please remove the test acc tracking since it should only be run once at the end and reported then

Best metric evaluated on the test set is one of indexes to make statistical accuracy analysis. With all the information we collected during training, we can compare the performance and accuracy of particular model among different platforms, among different version of tools used to run training.

@guanxingithub in general in machine learning it is not proper to run the test set before training/val has finished.
Tracking the best val acc across multiple epochs for perflab is fine, but there should only be one final test accuracy for them to track. running test during training is sort of "cheating". please address me and akihiro's concerns in order to get this merged.

We don't train the test set, we just evaluate the accuracy on the test set after the model is trained on the training set.

Hmmm, still not sure why it makes sense to evaluate the model on the test set during training. Just to make sure, I'm suggesting deleting L233 and L241-L242.

akihironitta · 2024-10-03T16:42:51Z

examples/ogbn_train.py

+        best_test = test_acc
+    times.append(time.time() - train_start)
+
+if verbose:


nit: The concern I previously had was having too many redundant/meaningless print statements in the example, but now the scirpt looks pretty minimal. I think it's ok to just remove this verbose flag.

The redundant/meaningless print was intended to collect raw data of performance and accuracy in order to make further performance and accuracy analysis after we run the scripts in the different platforms with different versions of tools.

its fine now, just remove verbose flag and let all prints come out (except remove the test accurac and please only run test set once at the end

akihironitta

Let's also make sure the filenames are updated in the entire repo, e.g.,

$ git grep ogbn_products_sage.py
README.md:- **[SAGEConv](https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.SAGEConv.html)** from Hamilton *et al.*: [Inductive Representation Learning on Large Graphs](https://arxiv.org/abs/1706.02216) (NIPS 2017) \[[**Example1**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/reddit.py), [**Example2**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/ogbn_products_sage.py), [**Example3**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/graph_sage_unsup.py), [**Example4**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/graph_sage_unsup_ppi.py)\]
README.md:- **[NeighborLoader](https://pytorch-geometric.readthedocs.io/en/latest/modules/loader.html#torch_geometric.loader.NeighborLoader)** from Hamilton *et al.*: [Inductive Representation Learning on Large Graphs](https://arxiv.org/abs/1706.02216) (NIPS 2017) \[[**Example1**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/reddit.py), [**Example2**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/ogbn_products_sage.py), [**Example3**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/ogbn_products_gat.py), [**Example4**](https://github.com/pyg-team/pytorch_geometric/blob/master/examples/hetero/to_hetero_mag.py)\]
torch_geometric/loader/neighbor_sampler.py:        `examples/ogbn_products_sage.py
torch_geometric/loader/neighbor_sampler.py:        ogbn_products_sage.py>`_.

examples/ogbn_train.py

Co-authored-by: Akihiro Nitta <[email protected]>

puririshi98 · 2024-10-04T16:42:46Z

examples/ogbn_train.py

+    if val_acc > best_val:
+        best_val = val_acc
+    if test_acc > best_test:
+        best_test = test_acc


@guanxingithub in general in machine learning it is not proper to run the test set before training/val has finished.
Tracking the best val acc across multiple epochs for perflab is fine, but there should only be one final test accuracy for them to track. running test during training is sort of "cheating". please address me and akihiro's concerns in order to get this merged.

guanxingithub and others added 27 commits June 3, 2024 19:51

Improve the accuracy to 63% for papers100M with graphSage model

315d527

[pre-commit.ci] auto fixes from pre-commit.com hooks

5f1832a

for more information, see https://pre-commit.ci

Update CHANGELOG.md

8ac366f

Update CHANGELOG.md

e9bff0d

remove commented code

0d83fd7

Combine with change made in CHANGELOG.md

d2572e1

[pre-commit.ci] auto fixes from pre-commit.com hooks

b5717d9

for more information, see https://pre-commit.ci

Set use_undirected_graph True as default

cd8caac

[pre-commit.ci] auto fixes from pre-commit.com hooks

5463314

for more information, see https://pre-commit.ci

fix for argparser for directed graph

11c8f5d

syntax

ce0df9b

Specify use_undirected_graph in [True,False]

4014942

Specify use_undirected_graph

133a565

clean up the code

9913fd4

Fix typo

bce8744

Defaultly run to_undirected

650a91e

Fix PEP8 failure

ba1278c

[pre-commit.ci] auto fixes from pre-commit.com hooks

6e8f4de

for more information, see https://pre-commit.ci

Delete examples/ogbn_papers_100m.py

aa64e85

Rename ogbn_papers_100m_sage.py to ogbn_papers_100m.py

77b9afd

Merge branch 'master' into performance_analysis

2fc1abe

Merge branch 'master' into performance_analysis

7f2aa64

Merge branch 'master' into performance_analysis

03a9450

update

5fb08e1

update

4a8aed7

1. Improve the accuracy to 57% on ogbn-papers100M with GAT model

9e4e98c

2. Add undirected graph on ogbn-products with Sage model 3. Add undirected graph on ogbn-products with GAT model

Merge branch 'performance_analysis' of https://github.com/guanxingith…

ba06d9f

…ub/pytorch_geometric into performance_analysis

guanxingithub requested a review from wsad1 as a code owner June 27, 2024 20:33

guanxingithub and others added 2 commits June 27, 2024 20:41

Update CHANGELOG.md

b2b21bb

Merge branch 'master' into performance_analysis

f60b790

puririshi98 and others added 5 commits September 4, 2024 12:13

Merge branch 'master' into performance_analysis

6053695

adding warning

7507eb9

[pre-commit.ci] auto fixes from pre-commit.com hooks

a73eaea

for more information, see https://pre-commit.ci

Update ogbn_train.py

b0a8cab

[pre-commit.ci] auto fixes from pre-commit.com hooks

d7aaba4

for more information, see https://pre-commit.ci

puririshi98 requested a review from akihironitta September 4, 2024 22:07

Merge branch 'master' into performance_analysis

0cad414

pre-commit-ci bot and others added 16 commits September 17, 2024 17:02

[pre-commit.ci] auto fixes from pre-commit.com hooks

250a2af

for more information, see https://pre-commit.ci

removing stale comment

7efd47d

cleanup

945c90e

[pre-commit.ci] auto fixes from pre-commit.com hooks

612c635

for more information, see https://pre-commit.ci

Update ogbn_train.py

3c4aecd

cleaning

da94aec

Update ogbn_train.py

823a182

Update ogbn_train.py

66a94e9

[pre-commit.ci] auto fixes from pre-commit.com hooks

e7db357

for more information, see https://pre-commit.ci

Update ogbn_train.py

85b32bf

Update ogbn_train.py

efd562f

Update ogbn_train.py

5fe2033

Update ogbn_train.py

c924521

Merge branch 'master' into performance_analysis

c93fdd8

Fix E501 error

bb1fd5d

[pre-commit.ci] auto fixes from pre-commit.com hooks

17db992

for more information, see https://pre-commit.ci

akihironitta reviewed Oct 3, 2024

View reviewed changes

examples/ogbn_train.py Outdated Show resolved Hide resolved

Update examples/ogbn_train.py

0068435

Co-authored-by: Akihiro Nitta <[email protected]>

puririshi98 requested changes Oct 4, 2024

View reviewed changes

Merge branch 'master' into performance_analysis

8d96530

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the accuracy on ogbn-papers100M with GAT model, add undirected graph on ogbn-products #9467

Improve the accuracy on ogbn-papers100M with GAT model, add undirected graph on ogbn-products #9467

guanxingithub commented Jun 27, 2024 •

edited by puririshi98

Loading

puririshi98 commented Sep 17, 2024

akihironitta left a comment

akihironitta Oct 3, 2024

puririshi98 Oct 3, 2024

guanxingithub Oct 4, 2024

guanxingithub Oct 4, 2024

puririshi98 Oct 4, 2024

guanxingithub Oct 4, 2024

akihironitta Oct 4, 2024

guanxingithub Oct 4, 2024

guanxingithub Oct 4, 2024

akihironitta Oct 3, 2024

puririshi98 Oct 3, 2024

guanxingithub Oct 4, 2024

puririshi98 Oct 4, 2024

guanxingithub Oct 4, 2024

akihironitta Oct 4, 2024

akihironitta Oct 3, 2024

guanxingithub Oct 4, 2024

puririshi98 Oct 4, 2024

akihironitta left a comment

puririshi98 Oct 4, 2024

Improve the accuracy on ogbn-papers100M with GAT model, add undirected graph on ogbn-products #9467

Are you sure you want to change the base?

Improve the accuracy on ogbn-papers100M with GAT model, add undirected graph on ogbn-products #9467

Conversation

guanxingithub commented Jun 27, 2024 • edited by puririshi98 Loading

puririshi98 commented Sep 17, 2024

akihironitta left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akihironitta left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

guanxingithub commented Jun 27, 2024 •

edited by puririshi98

Loading