ONNXRuntime Optimization Causes Output Discrepancy in Specific Model Structure (Output Y) #23209
Labels
model:transformer
issues related to a transformer model: BERT, GPT2, Hugging Face, Longformer, T5, etc.
Describe the issue
The optimization of an ONNX model using ONNXRuntime results in discrepancies between the original and optimized outputs, particularly for output Y. The issue occurs when running the optimized model and is not dependent on the optimization level (opt_level), but instead appears to be related to the specific structure of the model.
To reproduce
Urgency
No response
Platform
Linux
OS Version
Ubuntu 20.04
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
5c1b7cc
ONNX Runtime API
Python
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
No response
The text was updated successfully, but these errors were encountered: