ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT #4212

stable diffusion quantization in inpainting task is poor

i have completed stable diffusion quantization in txt2img as demo shows. the result is very good. when i want to transfer sd quantization in inpainting task, i meet the problem that the quantization r…

worhar updated 1 week ago
3
pytorch/TensorRT #2168

🐛 [Bug] Cannot export with PTQ using a cached calibrator: "T…

## Bug Description When doing Post-training quantization using the INT8 calibration API, the model export works fine when using the `ptq.DataLoaderCalibrator` but there is a runtime error when loa…

laclouis5 updated 10 months ago
7
pytorch/executorch #5914

QNN GPU or DSP backend issue

### 🐛 Describe the bug When trying to compile a model with the QNN partitioner with the GPU or DSP i get the following error: ``` [ERROR] [Qnn ExecuTorch]: Cannot Open QNN library libQnnDsp.so, w…

ismaeelbashir03 updated 1 month ago
4
thu-nics/ViDiT-Q #3

Why W8A8 is much slower and takes more GPU memory than fp16?

When trying to repeat your code, we find that when inferencing using default fp16, the peak memory goes with: about 9800MB But when inferencing using W8A8(after PTQ), the peak memory goes with: …

Leo-yang-1020 updated 1 month ago
8
openvinotoolkit/nncf #2568

PTQ of Fast R-CNN crashes in PyTorch backend

### Discussed in https://github.com/openvinotoolkit/nncf/discussions/2547 Originally posted by **MinGiSa** March 5, 2024 I've been working on converting Torch models into OpenVINO models rece…

alexsu52 updated 7 months ago
1
pytorch/TensorRT #3075

❓ [Question] failed to run the `examples/dynamo/vgg16_fp8_pt…

## ❓ Question I'm trying to run the `examples/dynamo/vgg16_fp8_ptq.y` example but got following error: ``` Traceback (most recent call last): File "/home/wh/generative_action/SynHSI/vgg_quat.p…

broken-dream updated 3 months ago
1
thu-nics/ViDiT-Q #1

Cannot find 'config.yaml' and 'opensora_config.py'

Hi, I faced the issue when I tried to run **6.1 normal inference** and **6.2 inference with mixed precision** as your indications. But something was wrong: **For 6.1 normal inference:** (viditq…

zjq0455 updated 4 months ago
2
gongouveia/Resnet-Quantization-Experiments #1

When I try to run this project, I find there is a bug in qua…

# Quantize the model model_prepared = tq.prepare(model_fused) model_quantized = tq.convert(model_prepared) # Define the quantization configuration quant_config = tq.get_default_qconfig('fbge…

DomineeringDragon updated 1 month ago
2
huawei-noah/bolt #119

TinyBert模型经过post_training_quantization进行INT8量化后，在Linux_X86-6…

1. X2bolt -d onnx -m model -i PTQ #输出为model_ptq_input.bolt 2. ./post_training_quantization -p model_ptq_input.bolt -i INT8_FP32 -b true -q NOQUANT -c 0 -o false 3. 推理报错如下： [ERROR] thread 121948 fil…

zxzlogic updated 2 years ago
4
facebookresearch/SpinQuant #5

Questions about replication experiments.

Are the code and parameters in this repository consistent with the parameters used in the experiments described in the paper? I conducted an experiment on an A100 using the provided command "bash 10_o…

BenchuYee updated 1 week ago
3

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for ptq

1000+ results
for ptq