edge_dim added as argument to GATv2Conv #310

allaffa · 2024-11-26T03:51:02Z

This PR applies two major changes:

allows to pass edge_dim as argument to GATStack because the original PyG implementation allows to use edge attributes GATv2Conv
parametrizes the examples qm9, md17, and LennardJones over all the message passing layers

These changes are crucial to support the future integration go graph transformer architectures and allow a robust testing of those capabilities that should flexibly switch across all message passing layers.

hydragnn/utils/input_config_parsing/config_utils.py

… or not

RylieWeaver · 2024-11-27T14:25:59Z

@allaffa @ArCho48

Max, the fixes you put in place look good to me. For brevity here's my understanding:

It looks like you identified the following Errors and Fixes:
(1) Some MP-layers may not inherently support the existence of positional gradients. In this case, you exclude them from the LJ-example. This fixed the errors with: PNA, SAGE, GIN, GAT, MFC
(2) The examples did not allow you to choose the MP-layer via a command-line-argument. In this case, you added this optionality. This fixed the errors with PNAPlus, PNAEq

Additionally, I added the following in my commits:

(1) Fixes to do with object device placement during degree aggregation in distributed data processing and message-passing. This was not causing errors in only CPU-environments like these tests, but would cause errors in GPU environments and was identified when testing on the DGX.
(2) Squeezed the energy predictions in Base to to avoid the shape mismatch warning.
(3) Add a comment explaining which types of MP-layers are allowed in the grad_energy example in test_examples.py
(4) Added MACE to the grad_energy example because it will propagate a positional gradient.

Lastly, I wasn't able to reproduce the following warning to fix. Do we still want to tackle it?:

/home/runner/work/HydraGNN/HydraGNN/hydragnn/utils/model/model.py:178: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).

allaffa · 2024-11-27T15:54:00Z

@allaffa @ArCho48

Max, the fixes you put in place look good to me. For brevity here's my understanding:

It looks like you identified the following Errors and Fixes: (1) Some MP-layers may not inherently support the existence of positional gradients. In this case, you exclude them from the LJ-example. This fixed the errors with: PNA, SAGE, GIN, GAT, MFC (2) The examples did not allow you to choose the MP-layer via a command-line-argument. In this case, you added this optionality. This fixed the errors with PNAPlus, PNAEq

Additionally, I added the following in my commits:

(1) Fixes to do with object device placement during degree aggregation in distributed data processing and message-passing. This was not causing errors in only CPU-environments like these tests, but would cause errors in GPU environments and was identified when testing on the DGX. (2) Squeezed the energy predictions in Base to to avoid the shape mismatch warning. (3) Add a comment explaining which types of MP-layers are allowed in the grad_energy example in test_examples.py (4) Added MACE to the grad_energy example because it will propagate a positional gradient.

Lastly, I wasn't able to reproduce the following warning to fix. Do we still want to tackle it?:

/home/runner/work/HydraGNN/HydraGNN/hydragnn/utils/model/model.py:178: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).

@RylieWeaver
Thank you for helping me out in this PR.

ArCho48 · 2024-11-27T17:29:39Z

Thank you both!

…

On Wed, Nov 27, 2024 at 12:26 Massimiliano Lupo Pasini < ***@***.***> wrote: Merged #310 <#310> into main. — Reply to this email directly, view it on GitHub <#310 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AMSKZ5POILL7MIXK7XZRVCT2CX6F5AVCNFSM6AAAAABSPOAGM6VHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJVGQ2TKOBQGM4DANI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

* edge_dim added as argument to GATv2Conv * GAT added in edge_models * added model_type as optional imput argument to qm9 and md17 examples * edge_dim passed into GATv2Conv stack inside create method * architectural arguments added to vectoroutput CI test * qnum_samples variable moved outside main function scope in qm9 example * SAGE, GIN, anFC removed from examples where there are edge features * Correct management of node degree for on GPUs * split examples test based on whether thee model needs to use data.pos or not * model_type overwrite in config moved to right location in the code * comment for allowed stacks in LJ force_grad * Add MACE to test * black formatting --------- Co-authored-by: Rylie Weaver <[email protected]>

edge_dim added as argument to GATv2Conv

c0df8b0

allaffa added the enhancement New feature or request label Nov 26, 2024

allaffa requested a review from ArCho48 November 26, 2024 03:51

allaffa self-assigned this Nov 26, 2024

allaffa added 4 commits November 25, 2024 23:33

GAT added in edge_models

20032a0

added model_type as optional imput argument to qm9 and md17 examples

25d3dba

edge_dim passed into GATv2Conv stack inside create method

989eaf2

architectural arguments added to vectoroutput CI test

1ed6510

allaffa requested review from RylieWeaver, JustinBakerMath and pzhanggit November 26, 2024 06:14

qnum_samples variable moved outside main function scope in qm9 example

ad2cef4

ArCho48 reviewed Nov 26, 2024

View reviewed changes

hydragnn/utils/input_config_parsing/config_utils.py Show resolved Hide resolved

This comment was marked as resolved.

Sign in to view

SAGE, GIN, anFC removed from examples where there are edge features

f6bc5d0

ArCho48 approved these changes Nov 26, 2024

View reviewed changes

RylieWeaver and others added 5 commits November 26, 2024 20:18

Correct management of node degree for on GPUs

8ff2553

split examples test based on whether thee model needs to use data.pos…

52d8d5a

… or not

model_type overwrite in config moved to right location in the code

98d5a94

comment for allowed stacks in LJ force_grad

ba5a8ce

Add MACE to test

e7084a0

black formatting

25b0898

allaffa assigned RylieWeaver Nov 27, 2024

allaffa merged commit b935c88 into ORNL:main Nov 27, 2024
2 checks passed

allaffa deleted the gatv2conv_edge_features branch November 27, 2024 17:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

edge_dim added as argument to GATv2Conv #310

edge_dim added as argument to GATv2Conv #310

allaffa commented Nov 26, 2024 •

edited

Loading

This comment was marked as resolved.

RylieWeaver commented Nov 27, 2024 •

edited

Loading

allaffa commented Nov 27, 2024

Max, the fixes you put in place look good to me. For brevity here's my understanding:

Additionally, I added the following in my commits:

ArCho48 commented Nov 27, 2024 via email

edge_dim added as argument to GATv2Conv #310

edge_dim added as argument to GATv2Conv #310

Conversation

allaffa commented Nov 26, 2024 • edited Loading

This comment was marked as resolved.

RylieWeaver commented Nov 27, 2024 • edited Loading

Max, the fixes you put in place look good to me. For brevity here's my understanding:

Additionally, I added the following in my commits:

allaffa commented Nov 27, 2024

Max, the fixes you put in place look good to me. For brevity here's my understanding:

Additionally, I added the following in my commits:

ArCho48 commented Nov 27, 2024 via email

allaffa commented Nov 26, 2024 •

edited

Loading

RylieWeaver commented Nov 27, 2024 •

edited

Loading