Closed ohadjerci closed 6 months ago
What did you mean of this? 【The FP16 engine is working but with less performance 】
You mean the latency is higher than int8? So that you want to use int8?
> What did you mean of this? 【The FP16 engine is working but with less performance 】
The engine is built but with higher FN
> You mean the latency is higher than int8? So that you want to use int8?
The error "Assertion status == kSTATUS_SCUESS failed" does not allow engine building
The engine is built but with higher FN
If the fp16 accuracy is not as expected, then it's meaningless to try int8.
Can you try fp32? And also try lower version of tensorrt. i.e. 8.4
> Can you try fp32? And also try lower version of tensorrt. i.e. 8.4
Yes the calibration int8 for RetinaFace work for previous TR versions, but i would like to know the reason of the error "Assertion scales.size() == 1 failed" with tr 8.6.1. I tried yolov7 and v9 with the same tr version and the calibration int8 work
Another information, we can no longer use the retinaface code from ubuntu 22.04 because only tensorRT version 8.6.1 is compatible.
To reproduce the error, just use docker "nvcr.io/nvidia/tensorrt:24.01-py3", install opencv and launch calib int8 from "https://github.com/wang-xinyu/tensorrtx/tree/master/retinaface"
Hi @ohadjerci The code was developed on TRT 7.x. I guess there are some operations/layers which are deprecated on TRT 8.6. As of now, we don't have plan to upgrade the code to support TRT 8.6. It would be great if you can debug and solve the issue.
No, the issue is not related to deprecated operations.
The solution is to retrain with precision bf16 or fp16 and replace the plugin decoder by the a cpu code.
Also I recommend to use the onnx parser with TRT10
Env
About this repo
repo wang-xinyu/tensorrtx/retinaface model retinaface
Hello,
The FP16 engine is working but with less performance and with some warning for TR 8.6.1.6
In the other hand, calibration int8 with tr8.6.1.6 leads to errors :
Firstly, i tried to generate an engine with a trained model (half precision). Secondly, i reorganize the code like yolov9/v7.. but without sucess.
I’m hoping someone can tell me more about this error msg, or point me to documents that can explain it. Does it mean that the building process failed due while processing a plugin or the scale of the interpolation ?
Any suggestion is highly appreciated. Thanks in advance.