-
您好,我看教程里面只有使用rknn自带的工具做ptq量化,如果使用外部工具做好qat或者ptq量化,然后得到量化参数,请问rknn怎么加载这个onnx网络和对应的量化参数?
-
I used mtq.INT8_default_CFG as recommended for CNN networks (mtq.quantize(model, config, forward_loop). My initial model ran at 80FPS after quantization it dropped to 40FPS? I checked the model struct…
-
I want to use qat method for my model, but i can only find ptq quantizer in executorch, are there some examples of how to implement Quantization Aware Training (QAT) for qnn backend?
-
### 🐛 Describe the bug
In `https://github.com/pytorch/pytorch/blob/master/torch/ao/quantization/observer.py#L208`, the epsilon value used to determine the uniform quantization scale is defined as
…
-
Hello, is it possible to use the camera directly in the model to generate results in real time? Like yolo, you can input the camera data and get the result in real time.
-
## Description
what is the right way to calibrate a hybrid quantization model ?
i built my tensorrt engine from ONNX model by the sub code, i selected the ``` class Calibrator(trt.IInt8EntropyCa…
-
### 请提出你的问题 Please ask your question
运行环境为:
Kylinv10 OS
Paddle 2.6.0
PaddleSlim 2.6.1
FT2000+ CPU
昆仑芯R200 XPU
原始模型为Pytorch导出的Resnet50转Paddle模型
PTQ代码如下:
```python
paddleslim.quant.quant_…
czp97 updated
2 weeks ago
-
https://github.com/pytorch/ao/tree/main/torchao/prototype/quantized_training
Currently INT8 training recipes only support **row-wise scaling** for weight. This should be strictly better than (or at…
-
### Your current environment
running via k8s (EKS) v0.6.3 on g6e.12xlarge instances (aws GPU AMI) with a llama-based model (72B params, FP8 weights+activation quantized)
### Model Input Dumps
…
-
I encountered an issue while trying to quantize the YOLOv8s model using the Ryzen AI quantizer. Below are the details of the error:
### Error Message:
```
No CUDA runtime is found, using CUDA_HOM…