cleanup function and ConvertQONNXtoFINN transformation issue #892

williamliao28 · 2023-09-16T18:12:41Z

name: Bug report
about: Something isn't working as expected
title: 'cleanup function and ConvertQONNXtoFINN transformation issue'
labels: bug
assignees: ''

Prerequisites

Test that the bug appears on the current version of the dev-branch. Make sure to include the commit hash of the commit you checked out.
Check that the issue hasn't already been reported, by checking the currently open issues.
If there are steps to reproduce the problem, make sure to write them down below.
If relevant, please include the ONNX files, which were created directly before and/or after the bug.

Quick summary

FINN cannot successfully convert a QONNX model with transpose before concatenation layer.

Details

Any suggestions are welcomed. I have tried the following experiments for this issue.

I export a model with transpose before concatenation layer to QONNX format using Brevitas. Then I try to convert the model in FINN by doing cleanup() and running ConvertQONNXtoFINN(). However, I get the following error message: onnx.onnx_cpp2py_export.shape_inference. Inferencerror: [ShapeInferenceError] (op_type: Cast, node name: Quant_4): [ShapeInferenceError] Inferred shape and existing shape differ in dimension 1: (64) vs (28). The traceback message is included in the screenshot posted below.

I have looked into the source code of the cleanup() function (in qonnx/util/cleanup.py) and the ConvertQONNXtoFINN() transformation (in finn/transformation/qonnx/convert_qonnx_to_finn.py). I found that they both call the transformation FoldTransposeIntoQuantInit() (line 40 in cleanup.py and line 81 in convert_qonnx_to_finn.py). This transformation attempts to fuse a transpose node into previous Quant or BipolarQuant node, and do a shape inference before returning the transformed model. It seems to assume that the transpose is 2D or does not involve channel dimension. When performing shape inference, it is expected that Quant or BipolarQuant nodes corresponding to quantized activation layers should have in_channel == out_channel. My test model has transpose involving channel dimension, so it fails.

Steps to Reproduce

I include example python codes in the links below which may be useful for reproducing the error. The file simple_test.py contains the PyTorch module defining the model with Brevitas quantization. The file simple_test_qonnx2finn.py is the code I run inside the FINN docker and this one produce the error message described above. The file simple_test_w1a1_cifar10_2epoch.onnx is a pertained model which can be used directly as the input for simple_test_qonnx2finn.py.

simple_test.py

simple_test_qonnx2finn.py

simple_test_w1a1_cifar10_2epoch.onnx

Clone the FINN repository
Checkout the dev branch
Start the docker container with the command: bash ./run_docker.sh
Put simple_test_w1a1_cifar10_2epoch.onnx under the directory build/quant_model/simple_test_cifar10/lr0.02
Run python simple_test_qonnx2finn.py

Expected behavior

The code simple_test_qonnx2finn.py should be run without any errors and produce the converted model.

Actual behavior

Running python simple_test_qonnx2finn.py generates the error described above.

The text was updated successfully, but these errors were encountered:

williamliao28 · 2023-09-16T18:13:26Z

@auphelia Could you help on this issue, please?

iksnagreb · 2023-10-02T09:45:51Z

Seems to be related to or even the same as one of the issues I am currently investigating: #878 (the third one from the list). However, I am not really making any progress on this particular one besides tracking it down to the FoldTransposeIntoQuantInit as well. I am stepping through the code and staring at graphs for some time now and am still not sure, whether I am violating assumptions (likely) or QONNX/FINN makes some wrong assumptions or, whether this is indeed a bug somewhere (maybe even in ONNX, not QONNX or FINN).

At least in my case, probably I just do not want to apply the transformation to the transpose at all (as it is part of the attention pattern). But for this, conditions for telling various uses of the transpose apart will be needed...

Did you make any progress by now @williamliao28? Otherwise, some input from you FINN people would be appreciated @auphelia @maltanar.

iksnagreb · 2023-10-04T09:30:12Z

For me it seems like currently all occurrences of the FoldTransposeIntoQuantInit are dealing with transpose nodes which are inserted by some other transforms like GemmToMatMul or the ChannelsLast/ChannelsFirst conversions. Is this true? In our cases, however, the transpose seems to be inherently part of the model.

fpjentzsch · 2023-11-10T16:07:16Z

Closing this as the issue was most likely fixed in fastmachinelearning/qonnx#78.

williamliao28 added the bug Something isn't working label Sep 16, 2023

This was referenced Oct 4, 2023

Problematic transformations while streamlining of scaled dot-product attention #878

Open

InferShapes fails after FoldTransposeIntoQuantInit fastmachinelearning/qonnx#77

Closed

Fix FoldTransposeIntoQuantInit Transformation fastmachinelearning/qonnx#78

Merged

fpjentzsch closed this as completed Nov 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cleanup function and ConvertQONNXtoFINN transformation issue #892

cleanup function and ConvertQONNXtoFINN transformation issue #892

williamliao28 commented Sep 16, 2023

williamliao28 commented Sep 16, 2023

iksnagreb commented Oct 2, 2023

iksnagreb commented Oct 4, 2023

fpjentzsch commented Nov 10, 2023

cleanup function and ConvertQONNXtoFINN transformation issue #892

cleanup function and ConvertQONNXtoFINN transformation issue #892

Comments

williamliao28 commented Sep 16, 2023

Prerequisites

Quick summary

Details

Any suggestions are welcomed. I have tried the following experiments for this issue.

Steps to Reproduce

Expected behavior

Actual behavior

williamliao28 commented Sep 16, 2023

iksnagreb commented Oct 2, 2023

iksnagreb commented Oct 4, 2023

fpjentzsch commented Nov 10, 2023