nvidia-riva / nemo2riva

NeMo -> Riva Conversion Tool
MIT License
8 stars 9 forks source link

nemo2riva export doesn't work with FastPitch #42

Open fr0zenshard opened 4 months ago

fr0zenshard commented 4 months ago

Description

When I try to export a model to Riva (using nemo2riva==2.14.0), the FastPitch model export doesn't work (meanwhile, the HiFiGAN export works wonderfully). I've tried various combinations of onnx and onnxruntime, combinations of NeMo and nemo2riva versions, but in the end, nothing works and the following error always pops up:

onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Failed to load model with error: Invalid tensor data type 0.

Steps/Code to reproduce bug

Using NeMo image (1.21.0, 1.22.0, 1.23.0) + installed nemo2riva==2.14.0:

nemo2riva --out $(OUT_RIVA).riva $(NEMO_MODEL) --key tlt_encode --runtime-check

Additional info

The failed results mentioned by the authors of the issue above can be obtained by:

  1. Using exported .riva FastPitch from nemo2riva without a runtime_check
  2. Downgrading onnx_graphsurgeon to version 0.3.27

In such an environment, the build and deploy of rmir will occur, but at the start of the Riva server, there will be either an:

fr0zenshard commented 4 months ago

I've tried trained models from previous version of NeMo and newly trained within version 1.23.0 and etc.

fr0zenshard commented 4 months ago

hey?

davesarmoury commented 2 months ago

I've been fighting this for days. Using the latest nemo, riva, etc in a fresh venv, I hit the same problem, but adding "--onnx-opset=14" to the nemo2riva command for fastpitch seems to be working