[OVEP] Add a check for type mismatches in QDQ stripping #834

mdvoretc-intel · 2025-10-21T16:09:14Z

Description

When rewiring the graph after eliminating QDQ pairs, the runtime now checks whether the type matches before and after the eliminated nodes and inserts a Cast node if there is a mismatch.

Motivation and Context

At present, QDQ elimination assumes the floating point type is the same before the QuantizeLinear node and after the following DequantizeLinear, producing errors if the types mismatch.

Jira Ticket :

CVS-175447

Kotomi-Du · 2025-10-22T04:13:04Z

The casting node is for QuantizeLinear node correct? Could you add the before and after graph after this casting? Will it introduce any performance penalty?

mdvoretc-intel · 2025-10-22T09:06:12Z

The cast is introduced when rewiring the graph to remove the QuantizeLinear and DequantizeLinear nodes, so that instead of a direct edge from e.g. a f32 input to a f16 output there's a Cast node added between them. The before and after are:

The resulting Convert node in OV is then optimized away in graph transformation stage, adding no specific performance impact.

mklimenk

Please use the output type of the DQ node directly and fix the C++ Lint warnings (C-style casts).

onnxruntime/core/providers/openvino/qdq_transformations/qdq_scales_fix.cpp

mdvoretc-intel · 2025-10-22T11:52:17Z

The issue that causes the use of a C-style cast for the input arg is that it is provided as const by OutputDefs but cannot be accepted as such by AddNode.

mklimenk

Let's roll back this unordered_map which is mainly unused. As for the C-style cast, this warning could be potentially suppressed by using the const_cast<T&>().

mklimenk · 2025-10-22T12:38:19Z

onnxruntime/core/providers/openvino/qdq_transformations/qdq_scales_fix.cpp

+          type_str_to_tensor_data_type_["tensor(uint64)"] = ONNX_NAMESPACE::TensorProto_DataType_UINT64;
+          type_str_to_tensor_data_type_["tensor(complex64)"] = ONNX_NAMESPACE::TensorProto_DataType_COMPLEX64;
+          type_str_to_tensor_data_type_["tensor(complex128)"] = ONNX_NAMESPACE::TensorProto_DataType_COMPLEX128;
+          type_str_to_tensor_data_type_["tensor(string)"] = ONNX_NAMESPACE::TensorProto_DataType_STRING;


Does it really make sense to add a possibility to convert fp32->string or to complex types, especially in the context of QDQ stripping?
Anyway, since QuantizeLinear and DequantizeLinear aren't operations per se, but are rather metaoperations, the fact that their input and output types differ seems more like an export bug to me. Let's cover that case only, but with an additional assertion to be on the safe side.

mklimenk

Looks good to me, thanks!

Kotomi-Du · 2025-10-22T17:06:44Z

Could you open the PR to get review from reviewers with write access? Please also add the Jira ticket.

When rewiring the graph after eliminating QDQ pairs, the runtime now checks whether the type matches before and after the eliminated nodes and inserts a Cast node if there is a mismatch.

mklimenk suggested changes Oct 22, 2025

View reviewed changes

onnxruntime/core/providers/openvino/qdq_transformations/qdq_scales_fix.cpp Outdated Show resolved Hide resolved

mklimenk suggested changes Oct 22, 2025

View reviewed changes

mklimenk approved these changes Oct 22, 2025

View reviewed changes

mdvoretc-intel added 3 commits October 23, 2025 09:26

[OVEP] Add a check for type mismatches in QDQ stripping

303b81a

When rewiring the graph after eliminating QDQ pairs, the runtime now checks whether the type matches before and after the eliminated nodes and inserts a Cast node if there is a mismatch.

Expand type transform

de6e250

Limit output types to f32/f16, add const_cast

69f09bb

mdvoretc-intel force-pushed the qdq_type_fix branch from 19f5230 to 69f09bb Compare October 23, 2025 08:26

mdvoretc-intel marked this pull request as ready for review October 23, 2025 08:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OVEP] Add a check for type mismatches in QDQ stripping #834

[OVEP] Add a check for type mismatches in QDQ stripping #834

Uh oh!

mdvoretc-intel commented Oct 21, 2025 •

edited

Loading

Uh oh!

Kotomi-Du commented Oct 22, 2025 •

edited

Loading

Uh oh!

mdvoretc-intel commented Oct 22, 2025

Uh oh!

mklimenk left a comment

Uh oh!

Uh oh!

mdvoretc-intel commented Oct 22, 2025

Uh oh!

mklimenk left a comment

Uh oh!

mklimenk Oct 22, 2025

Uh oh!

mdvoretc-intel Oct 22, 2025

Uh oh!

mklimenk left a comment

Uh oh!

Kotomi-Du commented Oct 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[OVEP] Add a check for type mismatches in QDQ stripping #834

Are you sure you want to change the base?

[OVEP] Add a check for type mismatches in QDQ stripping #834

Uh oh!

Conversation

mdvoretc-intel commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Jira Ticket :

Uh oh!

Kotomi-Du commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdvoretc-intel commented Oct 22, 2025

Uh oh!

mklimenk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mdvoretc-intel commented Oct 22, 2025

Uh oh!

mklimenk left a comment

Choose a reason for hiding this comment

Uh oh!

mklimenk Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

mdvoretc-intel Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

mklimenk left a comment

Choose a reason for hiding this comment

Uh oh!

Kotomi-Du commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mdvoretc-intel commented Oct 21, 2025 •

edited

Loading

Kotomi-Du commented Oct 22, 2025 •

edited

Loading

Kotomi-Du commented Oct 22, 2025 •

edited

Loading