ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Xiuyu-Li/q-diffusion #14

Why this quantization model need more than 24GB GPU memory …

### 1、Questions As we Known, SD v1.5 has 1 Billions params , and it's peek GPU memory is about 4G at the precison fp32. So, the memory of int4 precison (sd_w4a8_chpt.pth) will be about 4G/8 = 500…

felixslu updated 4 months ago
4
IntelLabs/distiller #520

Weights not properly quantized during Quantization Aware Tra…

Hi, I'm working on applying QAT on a model. I made the necessary modifications. However, when I looked into one of the saved checkpoint `.pth` files, I observed that none of the weights were actually …

shazib-summar updated 4 years ago
2
pytorch/ao #987

[RFC] Long Term QAT Flow

Currently torchao QAT has two APIs, [tensor subclasses](https://github.com/pytorch/ao/blob/a4221df5e10ff8c33854f964fe6b4e00abfbe542/torchao/quantization/prototype/qat/api.py#L41) and [module swap](htt…

andrewor14 updated 1 month ago
7
alibaba/TinyNeuralNetwork #374

How to quantize ViT model with quantization aware training

It can train the ViT model from the Hugging Face transformer, but when converting to tflite model it appear an error message that I can't solve it. The following are the tinynn setting and the error…

Linsop2 updated 1 week ago
3
pytorch/pytorch #128114

Import Error: cannot import name 'XNNPACKQuantizer' from 'to…

### 🐛 Describe the bug from torch.ao.quantization.quantizer import ( XNNPACKQuantizer, get_symmetric_quantization_config, ) the code abve report error: ImportError: cannot import name 'X…

1826133674 updated 3 months ago
3
Mandylove1993/CUDA-FastBEV #21

Is there a tutorial that helps me perform real-time detectio…

I want to apply this algorithm to a Jetson AGX Orin development board. However, many difficulties were encountered, such as failed installation of libraries such as mmcv and mmdet. Pyquaternion inst…

polarbear122 updated 11 months ago
3
kendryte/nncase #1261

可以提供一下动态shape真实能用的例子吗

**Describe the bug** 非常痛苦动态shape根本转不出来 **To Reproduce** ```python import nncase import numpy as np import onnx import onnxsim # from nncase_base_func import model_simplify, read_model_fil…

willdla updated 3 weeks ago
1
ROCm/AMDMIGraphX #2515

Add INT8 example for BERT, Distil, and GPT2

Add one example to repo and DLM

causten updated 11 months ago
1
pytorch/pytorch #90289

quantization qconfig: can we set per-channel quant as defaul…

### 🐛 Describe the bug The current default qconfig for qnnpack is per-tensor quantization. Can we update the default qnnpack qconfig to per-channel quantization? I heard that per-channel has been s…

vkuzo updated 1 year ago
3
sympy/sympy #16949

solve is too slow for small systems of equations

The following problem is solved almost instantaneously in both SageMath and Matlab, but it takes much longer in sympy. The problem has 8 solutions. If I use `manual=True`, solve is very fast, but it f…

mtomassoli updated 5 years ago
3

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for ptq

1000+ results
for ptq