Describe the issue fp32 onnx model is works well. But when I c

A possible cause: <a class="issue-link js-issue-link" data-error-text="Failed to load

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

okay Thank you reply <a class="user-mention notranslate" data-hovercard-type="user" da

invalid converting fp32 to fp16 about onnxruntime HOT 4 CLOSED

Macsim2 commented on June 25, 2024

invalid converting fp32 to fp16

from onnxruntime.

Comments (4)

tianleiwu commented on June 25, 2024

A possible cause: microsoft/onnxconverter-common#271

If you find two Cast nodes linked like:

A walkaround is to remove extra Cast nodes like

import onnx
from onnxruntime.transformers.onnx_model import OnnxModel
onnx_model=OnnxModel(onnx.load("path/to/model_fp16.onnx"))
onnx_model.remove_cascaded_cast_nodes()
onnx_model.save_model_to_file("path/to/model_fp16_v2.onnx", use_external_data_format=False, all_tensors_to_one_file=True)

from onnxruntime.

Macsim2 commented on June 25, 2024

@tianleiwu Thank you that comments me
I checked the cast nodes through netron app

This is "model_fp16.onnx"

and then I just executed what you said
However, It doesn't works for me sadly
this is "model_fp16_v2.onnx"

For reference, I think that onnxruntime or onnxconverter_common are not invalid.
Because other models are works well. except for the probelm models (the problem model is bigger than other models)
Do you have any hint can I do by anychance?

from onnxruntime.

tianleiwu commented on June 25, 2024

Not every model can run in fp16. Sometime, some nodes need to upcast to fp32 to avoid overflow.

You can add --cmake_extra_defines onnxruntime_DEBUG_NODE_INPUTS_OUTPUTS=1 in build command line to build a package from source code. Set environment variable ORT_DEBUG_NODE_IO_DUMP_OUTPUT_DATA to be 1 before running your application. In this way, you can see of output of each node. Compare the stdout of two runs (fp32 model vs fp16 model) can find out the root cause. See https://onnxruntime.ai/docs/build/eps.html#cuda for more information.

from onnxruntime.

Macsim2 commented on June 25, 2024

okay Thank you reply @tianleiwu
I'll keep in track this problem and I will share with you if I found the reason why the model had didn't work

from onnxruntime.

Recommend Projects

invalid converting fp32 to fp16 about onnxruntime HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent