Open 1049451037 opened 1 year ago
The model seems big and can't run in 16GB GPU. What GPU and memory size are you using?
I run on a 24GB RTX 3090.
any update on this , i am also facing same error . i can run the onnx model on gpu=0 but can't run on gpu=1.
[E:onnxruntime:log, cuda_call.cc:119 CudaCall] CUDA failure 1: invalid argument ; GPU=1 ; hostname=90944f8d90dc ; expr=cudaMemcpyAsync(output.MutableDataRaw(), input.DataRaw(), input.Shape().Size() * input.DataType()->Size(), cudaMemcpyDeviceToDevice, stream); 2023-07-20 09:23:03.925983174 [E:onnxruntime:, sequential_executor.cc:494 ExecuteKernel] Non-zero status code returned while running Einsum node. Name:'/model/layer.0/rel_attn/Einsum_8' Status Message: /workspace/onnxruntime/onnxruntime/core/providers/cpu/math/einsum_utils/einsum_auxiliary_ops.cc:298 std::unique_ptr<onnxruntime::Tensor> onnxruntime::EinsumOp::Transpose(const onnxruntime::Tensor&, const onnxruntime::TensorShape&, const gsl::span<const long unsigned int>&, onnxruntime::AllocatorPtr, void*, const Transpose&) 21Einsum op: Transpose failed: CUDA failure 1: invalid argument ; GPU=1 ; hostname=90944f8d90dc ; expr=cudaMemcpyAsync(output.MutableDataRaw(), input.DataRaw(), input.Shape().Size() * input.DataType()->Size(), cudaMemcpyDeviceToDevice, stream);
Describe the issue
I have a onnx file that can run normally on cuda device 0, but raises this error when I run it on device 1:
I also tried to run on device 2, same error happens.
To reproduce
Download the following onnx: https://cloud.tsinghua.edu.cn/f/4f0a921584564e45be6d/?dl=1
Run it with python:
Urgency
No response
Platform
Linux
OS Version
20.04
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.14.0
ONNX Runtime API
Python
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
No response