torchsummary()
extensions by input_initializer
, dtype
#89
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Background
I needed to summarize the OpenAIGPTDoubleHeadsModel in huggingface/pytorch-transformers which takes as (dummy) input multiple
torch.zeors()
tensors withdtype=torch.int64
. This is currently not supported in the pytorch-summary tool, so I extended it.Extensions:
dtype
totorchsummary()
input variablesinput_initializer
totorchsummary()
input variablesBugfix:
batch_size
default value from-1
to2
so it is acutally uses and returns a correcttotal_input_size
TypeError
:Testing:
run_openai_gpt.py
-script after modificationrun_openai_gpt.py
-script by adding the following lines after the model was loaded:To analyze the input tensors you can use this code snippet:
P.S.:
Thanks for the tool 👍, guess I'll be using it quite often... it's nice & simple with great overview!