Works around internal PyTorch bug related to shared weights #96

daniel-j-h · 2018-08-01T13:29:09Z

The PyTorch tracing model exporter has an issue with shared weights / refs to layers, see

The workaround is to not store any refs to the resnet layers in the model itself. Instead we have to directly pull the input tensors through the resnet layers in the forward pass.

This will change the .pth file format. We could work around that by loading the serialized state dict in non-strict fashion in the export tool. But then we are opening up the door to letting actual errors going through.

Note: we need to fix the FPN pull request in a similar way #75

cc @bkowshik @maning

daniel-j-h · 2018-08-02T14:26:34Z

This needs a release with major version bump since we are breaking the .pth format.

Works around internal PyTorch bug related to shared weights

ca3bc00

daniel-j-h merged commit ca3bc00 into master Aug 2, 2018

daniel-j-h deleted the shared-weights branch August 2, 2018 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Works around internal PyTorch bug related to shared weights #96

Works around internal PyTorch bug related to shared weights #96

daniel-j-h commented Aug 1, 2018

daniel-j-h commented Aug 2, 2018

Works around internal PyTorch bug related to shared weights #96

Works around internal PyTorch bug related to shared weights #96

Conversation

daniel-j-h commented Aug 1, 2018

daniel-j-h commented Aug 2, 2018