ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

open-mmlab/mmrazor #449

quantization config file

What is the config file I need to parse into the tools ptq or qat file? is it the config file for the model like the one in mmdetection? Can you give me an example of how to quantize yolox to onnx bac…

shuyuan-wang updated 1 year ago
1
bytedance/MRECG #5

python implementation of module capacity calculation

Hello! After reading your paper carefully, the method proposed in the paper is effective for PTQ, and I am very interested in your method. As for the module capacity calculation method mentioned in th…

shoayi updated 9 months ago
1
quic/aimet #2581

issue about the exported onnx model

I use the AIMET PTQ to quantize the CLIP text model. But I encounter this error [KeyError: 'Graph has no buffer /text_model/encoder/layers.0/layer_norm1/Constant_output_0, referred to as input for …

czy2014hust updated 5 months ago
15
open-mmlab/mmrazor #478

Does mmrazor support instance segmentation model?

### Checklist - I have searched related issues but cannot get the expected help. - I have read related documents and don't know what to do. ### Describe the question you meet Does mmrazor su…

flyzxm5177 updated 5 months ago
4
microsoft/onnxruntime #10422

Quantization of video action recognition model

Hi, I am looking at quantization of a SlowFast model. The input, according to netron is 1x1x3x32x256x256. May I know if it is possible to perform PTQ on the model? Do I just implement the Calibr…

MrOCW updated 2 years ago
3
NVIDIA/TensorRT #4068

Missing scale and zeropoint for lot of layers on calibrating…

## Description I generated calibration cache for Vision Transformer onnx model using EntropyCalibration2 method. When trying to generate engine file using cache file for INT8 precision using trte…

Shalini194 updated 2 months ago
14
THUDM/ChatGLM-6B #1042

为什么模型精度降低，推理耗时反而增大了？

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 我是用t4进行推理，它是支持int8和int4的输入长度为1000，int4需要32s，fp16只需要12s ### Expected Behavior _No respon…

bulubulu-Li updated 1 year ago
6
quic/aimet #1974

Does the bn reestimation really work?

There's something wrong with my verification, could you give me some reference data？ My experiment is as follows： 1, ptq —> 2, qat —> 3, bn reestimation —> 4, fold_all_batch_norms_to_scale afte…

666DZY666 updated 1 year ago
5
hkproj/quantization-notes #1

Others: Doubt regarding the scales and zero value calculate …

## 📝 Description While going through your YouTube video explanation on Quantisation. I came across this doubt when I was validating the formulas of `scales` and `zero_point` for Asymmetric and Symmet…

kaushikepi updated 6 months ago
1
vllm-project/llm-compressor #848

[Question]Does Minicpmv2.6 currently support int8/fp8 quanti…

Does Minicpmv2.6 currently support int8/fp8 quantization? thanks~

wjj19950828 updated 2 weeks ago
2

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for ptq

1000+ results
for ptq