Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
RFC: Fix Edge Feature Processing in InteractionNetwork
Summary
The current implementation of
InteractionNetwork
does not properly propagate processed edge features through the message passing flow, causing the residual connection to simply double the original edge features instead of adding processed features to the original ones.Background
In the current implementation, the
message()
method processes edge features but these processed features are not properly passed through to theupdate()
method. This causes theupdate()
method to use the original edge features instead of the processed ones, resulting in incorrect residual connections.Problem
When running the network:
produces:
This indicates the output is simply doubling the input due to the residual connection, rather than adding processed features to the original ones.
Proposed Solution
Add a temporary instance variable to store processed edge features between message and update steps:
Rationale
Testing
New tests verify:
Alternatives Considered
__init__
Backward Compatibility
This change maintains the same interface and output shapes, but will produce different numerical results. Models trained with the old implementation will need to be retrained.