Describe the issue May I ask when will onnxruntime support for set

Sure, the model is downloaded from onnx model zoo:(<a href="https://github.com/onnx/mo

When will onnxruntime make it available to get nodeunit inputs correctly for QLinearConcat about onnxruntime HOT 5 OPEN

chenfeiyue-cfy commented on July 24, 2024

When will onnxruntime make it available to get nodeunit inputs correctly for QLinearConcat

from onnxruntime.

Comments (5)

skottmckay commented on July 24, 2024

Do you have a model where this causes an issue?

from onnxruntime.

chenfeiyue-cfy commented on July 24, 2024

Sure, the model is downloaded from onnx model zoo:(https://github.com/onnx/models/blob/main/validated/vision/classification/densenet-121/model/densenet-12-int8.onnx).
The unit test is from path onnxruntime/onnxruntime/test/contrib_ops/qlinear_concat_test.cc(https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/test/contrib_ops/qlinear_concat_test.cc)

from onnxruntime.

skottmckay commented on July 24, 2024

I don't think the lack of support makes a difference in this case.

QLinearConcat is an internal operator, so we would expect it to be handled by our internal execution providers. In that scenario, the fact that the zp/scale values aren't nicely provided by the NodeUnit doesn't change anything as nothing (AFAIK) is trying to read the zp and scale from the NodeUnit as we have static kernels for CPU/CUDA/DML execution providers to handle QLinearConcat.

Really for the model to be in the ONNX repo it should be in QDQ format (DequantizeLinear on each input with a Concat node and a QuantizeLinear on the output) instead of having been saved using internal onnxruntime operators. With that model format any execution provider could convert the set of nodes into a quantized concat operation, and it would want the zp/scale for the inputs to be conveniently available via the NodeUnit. That's a different NodeUnit constructor though and not the path with the TODO comment.

from onnxruntime.

chenfeiyue-cfy commented on July 24, 2024

Sure it is not an error occured, but in the node_unit.cc file, I saw QLinearMatMul and QLinearConv opshave been handled like that .QDQMatmul/QDQConv have made node_arg and quant_param binded together with one single input for node_unitI made this request because our execution provider treats quantization operations and normal operations equally.If QLinearConcat could be easier to get input like QLinearConv it will make our provider work more efficiently.

from onnxruntime.

skottmckay commented on July 24, 2024

I have added an item to the backlog to look at adding the quantization info to a NodeUnit for the QLinearX operators. No guarantee when we'll get to it though.

from onnxruntime.

Recommend Projects

When will onnxruntime make it available to get nodeunit inputs correctly for QLinearConcat about onnxruntime HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent