Inference Output CPU vs CUDA not the same [1.19.2]

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

MIT License

14.77k stars 2.94k forks source link

Describe the issue

Hey everyone,

I was testing a model for face occlusion and I am getting different results between GPU and CPU. Happy to help if anyone can give me into the right direction? (e.g. debug) to help fixing this issue.

Cheers

To reproduce

Load the model on the same hardware -> different results.

Urgency

Very important

Platform

Linux

OS Version

20.05

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.19.2

ONNX Runtime API

Python

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

CUDA 12.5

microsoft / onnxruntime