ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rockchip-linux/rknn-toolkit #147

请问如何加载onnx模型，并且使用已知的量化参数进行量化

您好，我看教程里面只有使用rknn自带的工具做ptq量化，如果使用外部工具做好qat或者ptq量化，然后得到量化参数，请问rknn怎么加载这个onnx网络和对应的量化参数？

helloyongyang updated 2 years ago
1
NVIDIA/TensorRT-Model-Optimizer #14

Tried to apply PTQ to a basic CV CNN network and got slower …

I used mtq.INT8_default_CFG as recommended for CNN networks (mtq.quantize(model, config, forward_loop). My initial model ran at 80FPS after quantization it dropped to 40FPS? I checked the model struct…

tmagcaya updated 1 month ago
13
pytorch/executorch #2169

Does executorch support qat quant for qualcomm qnn backend?

I want to use qat method for my model, but i can only find ptq quantizer in executorch, are there some examples of how to implement Quantization Aware Training (QAT) for qnn backend?

Novelfor updated 6 months ago
1
pytorch/pytorch #90288

quantization observers: can we relax the default epsilon val…

### 🐛 Describe the bug In `https://github.com/pytorch/pytorch/blob/master/torch/ao/quantization/observer.py#L208`, the epsilon value used to determine the uniform quantization scale is defined as …

vkuzo updated 1 year ago
4
fundamentalvision/BEVFormer #191

Questions about real-time

Hello, is it possible to use the camera directly in the model to generate results in real time? Like yolo, you can input the camera data and get the result in real time.

PlutoXN updated 6 months ago
1
NVIDIA/TensorRT #3978

How to make PTQ calibration for a Hybrid Quantization model …

## Description what is the right way to calibrate a hybrid quantization model ？ i built my tensorrt engine from ONNX model by the sub code, i selected the ``` class Calibrator(trt.IInt8EntropyCa…

renshujiajia updated 4 months ago
3
PaddlePaddle/Paddle #69224

使用PaddleSlim进行离线静态量化并导出模型，使用PaddleLite进一步将量化模型转化为int8模型，使用Pa…

### 请提出你的问题 Please ask your question 运行环境为： Kylinv10 OS Paddle 2.6.0 PaddleSlim 2.6.1 FT2000+ CPU 昆仑芯R200 XPU 原始模型为Pytorch导出的Resnet50转Paddle模型 PTQ代码如下： ```python paddleslim.quant.quant_…

czp97 updated 2 weeks ago
2
pytorch/ao #1010

Add weight tensor-wise scaling for INT8 quantized and mixed-…

https://github.com/pytorch/ao/tree/main/torchao/prototype/quantized_training Currently INT8 training recipes only support **row-wise scaling** for weight. This should be strictly better than (or at…

gau-nernst updated 1 month ago
1
vllm-project/vllm #10002

[Bug]: RuntimeError: Engine loop has died with larger contex…

### Your current environment running via k8s (EKS) v0.6.3 on g6e.12xlarge instances (aws GPU AMI) with a llama-based model (72B params, FP8 weights+activation quantized) ### Model Input Dumps …

sam-huang1223 updated 1 week ago
4
amd/RyzenAI-SW #122

Error during YOLOv8s quantization with Ryzen AI quantizer (R…

I encountered an issue while trying to quantize the YOLOv8s model using the Ryzen AI quantizer. Below are the details of the error: ### Error Message: ``` No CUDA runtime is found, using CUDA_HOM…

Siva50005 updated 2 months ago
11

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for ptq

1000+ results
for ptq