-
Here we keep track of what part of `quantize` in `ptq_common.py` are tested and what are still missing.
-
-
Hello,
Thank you for this library. I had a couple of questions regarding the code uploaded:
1. Your paper mentions that both QAT and PTQ is possible and also shows results. However, your code does n…
-
Hi,
Thanks for the great work!
Have your team tried QAT/PTQ int8 quantization on star operations? After all, the networks are usually quantized before deploying in real production.
Thanks for…
-
### 🚀 Feature request
The task is to
* Map following types to a metatypes: index, linalg_vector_norm, clamp_min, clamp
* Add nncf graph tests to cover nncf graph building for target models scope: …
-
### Discussed in https://github.com/openvinotoolkit/nncf/discussions/2547
Originally posted by **MinGiSa** March 5, 2024
I've been working on converting Torch models into OpenVINO models rece…
-
I'm having issues to verify that a simulated quantized onnx file offers decent performance
Issue: After doing PTQ. I cannot use the quantized model in onnx-runtime! (preferably GPU)
-
Thanks for your nice work!
I've reproduced BEVFusion PTQ performace with the model(Resnet50) you provided and the script **tools/test-mAP-for-cuda.py** following [https://github.com/NVIDIA-AI-IOT…
-
按照您的步骤,测试了yolov5和yolov8的fp32、fp16的精度都算正常,都降了一点,但是int8的精度非常低,这可能是什么原因呢?
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.012
Average Precision (AP) @[ IoU=0.50 | area…
-
### Search before asking
- [X] I have searched the YOLOv6 [issues](https://github.com/meituan/YOLOv6/issues) and found no similar feature requests.
### Description
Hi YOLOv6 Team,
I am currentl…