Open CaptainRui1000 opened 2 months ago
Pass high_precision_dtype="fp32"
and --op_types_to_quantize=["Conv"]
to quantize()
function. Then compilation with trtexec
should work.
But I see that trtexec --onnx=LightStereo-S-KITTI.preprocessed.onnx --best
gives the best runtime of 4.1084 ms
, while modelopt quantized output ONNX with above parameters gives 5.04126 ms
. We are working on this gap.
error info:
model structure:
model onnx: https://drive.google.com/file/d/1gP568tWTZXISpwbB7r76xXwudR61_z0k/view?usp=sharing
I used the recommended function and default parameters to ptq