Conformer CTC converted with nemo2riva 2.13.1 deployed on Riva 2.13.1 fails to load

nvidia-riva / nemo2riva

NeMo -> Riva Conversion Tool

MIT License

9 stars 9 forks source link

I have a conformer CTC model built with the NeMo framework (https://github.com/NVIDIA/NeMo), which can be normally converted and deployed with Riva 2.11.0. However, if I convert the same NeMo file to Riva 2.13.1, and deploy, Riva (Triton server) fails to start with the error

UNAVAILABLE: Internal: onnx runtime error 1: Load model from /data/models/streaming/1/model.onnx failed :/workspace/onnxruntime/onnxruntime/core/graph/model.cc:146 onnxruntime::Model::Model(onnx::ModelProto&&, const PathString&, const IOnnxRuntimeOpSchemaRegistryList*, const onnxruntime::logging::Logger&, constonnxruntime::ModelOptions&) Unsupported model IR version: 9, max supported IR version: 8

I have tried building with --onnx_opset=15, and --onnx_opset=17, like it was mentioned in https://github.com/NVIDIA/NeMo/discussions/7278, but nothing helps.

nvidia-riva / nemo2riva

Conformer CTC converted with nemo2riva 2.13.1 deployed on Riva 2.13.1 fails to load #36