model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

onnx/models #362

Can we include some quantized models w.r.t. Quantization-awa…

# Model Request ### Which model would you like to see in the model zoo? A quantized MobileNet (doesn't matter which version) could be fine. TensorFlow has published end to end quantized [MobileNet…

zhenhuaw-me updated 4 years ago
2
tensorflow/model-optimization #832

perform inference after QAT

Hello, I would like to train my model in a QAT scenario. But from what I understand, during QAT, only the Forward pass calculations are done in quantized mode, whereas the weights that are saved are…

lovodkin93 updated 9 months ago
3
pytorch/pytorch #126266

Quantization occurs with RuntimeError: `zero_point` must be …

### 🐛 Describe the bug The strange thing is that when I train only 100 epochs with FP32, the model can quantize normally, when I train 200 or more epochs and then try to do the quantization - the mod…

XiudingCai updated 5 months ago
1
openvinotoolkit/openvino #27581

[Bug]: OpenVINO will only use half of my available system th…

### OpenVINO Version 2024.0.0 - Current ### Operating System Windows 10 Professional 2004 [Version 10.0.19041.1415] ### Device used for inference CPU (Intel Xeon E-2288G CPU [Coffee Lak…

BlohoJo updated 1 day ago
8
quic/aimet #2930

result from aimet evaluation and result after quantization o…

Hi: I tried QAT on a model and exported the encodings. Then, I used the qnn-onnx-converter with --quantization_overrides and --input_list trying to put min/max/scale value after QAT into the converte…

superpigforever updated 6 months ago
2
NVIDIA/TensorRT #4197

trt10.5 pytorch-quantization has compile bug

## Description trt10.5 pytorch-quantization has compile bug. https://github.com/NVIDIA/TensorRT/blob/release/10.5/tools/pytorch-quantization/src/tensor_quant_gpu.cu#L28-L37 define two macro `AT_DI…

lix19937 updated 1 month ago
1
YanjingLi0202/Q-ViT #13

Is the model really Quantized?

navinranjan7 updated 1 month ago
3
CompVis/stable-diffusion #229

no module named 'torch.ao'

`python scripts/txt2img.py --prompt "a photograph of a huge bear, style of TIME magazine" --plms /home/grayson/miniconda3/envs/ldm/lib/python3.8/site-packages/torchvision/io/image.py:13: UserWarning…

prismspecs updated 1 year ago
3
bd-iaas-us/vllm #18

[Bug]: I get an error when I try to build the Docker Image f…

### Your current environment ```text PyTorch version: 2.1.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Microsoft Windows 11 Home GCC vers…

mkhammoud updated 2 weeks ago
2
state-spaces/mamba #311

Likely bug in perplexity calculation

I tried quantizing Mamba using HuggingFace/Quanto and ran into the problem of perplexity for `lambada_openai` blowing up (> 1e^37) at lower quantization levels, even though other tasks retained their …

qwoprocks updated 1 month ago
4

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization