PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
BSD 3-Clause "New" or "Revised" License
2.58k
stars
350
forks
source link
🐛 [Bug] Torch-TRT QDQ nodes affect perf vs PTQ, native TRT they do not #1323
Closed
ncomly-nvidia closed 1 year ago
Bug Description
When using the PyT-QAT toolkit, QAT perf is slower than PTQ, for TRT this is not the case.
Torch-TRT: