post-training-quantization Search Results

1000+ results
for post-training-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PaddlePaddle/PaddleSlim #1866

报错：var_tensor.shape[0]，tuple index out of range

遇到如下报错，不知道为什么，dataloader拿出的数据应该是正确的 Mon Apr 08 21:36:18-INFO: Collect quantized variable names ... Sampling stage, Run batch:| | 0/100 Traceback (most r…

Suncheng2022 updated 3 months ago
1
openvinotoolkit/nncf #2766

[TorchFX] Torch FX/PyTorch 2 Export Quantization

### 🚀 Feature request Quantization is a widely used technique to accelerate models, particularly when using the [torch.compile](https://pytorch.org/tutorials/intermediate/torch_compile_tutorial.htm…

alexsu52 updated 2 weeks ago
2
dbolya/yolact #484

Quantization: Train on cpu for post training quantization

Hey everyone, I want to train the model on CPU to use the pytorch post dynamic quantization afterwards. and then run the eval.py on cuda to improve the inference. I am having trouble running the m…

sabine1993 updated 3 years ago
1
NVIDIA/TensorRT-LLM #1463

[FP8 Post-Training Quantization] "use_fp8_context_fmha" Not …

### System Info CPU-X86 GPU-H100 Server XE9640 Code: TensorRT-LLM 0.8.0 release ### Who can help? @Tracin @juney-nvidia Regarding the [FP8 Post Quantization]((https://github.com/NVIDIA/Tenso…

taozhang9527 updated 2 months ago
3
facebookresearch/PyTorch-BigGraph #156

Does PGB support post-training quantization?

I am wondering if PGB support Post-training quantization for instance what we we have for fasttext: https://flavioclesio.com/2019/03/22/post-training-quantization-in-fasttext-or-how-to-shrink-your-fas…

yazdavar updated 3 years ago
3
apple/coremltools #2227

need help about both model weight and activation quantizatio…

from the issue "https://developer.apple.com/forums/thread/740518 how do we use the computational power of A17 Pro Neural Engine?" I learn that if i want to inference my mlmodel on my ipad pro with …

AndreaChiChengdu updated 1 month ago
1
THU-MIG/yolov10 #131

int8 quantization support

Firstly, thanks to all of you for the bravo project! Currently, the model seems like does not support int8 quantization. Any plan on it?

qiangxinglin updated 1 month ago
2
thu-nics/qllm-eval #6

Does KV cache belong to Activation?

The survey discusses the sensitivity of activation quantization and the tolerance of KV cache quantization in the context of post-training quantization (PTQ) for large language models (LLMs). It makes…

pprp updated 3 months ago
1
microsoft/onnxruntime #21138

Quantized ONNX Model Still Has Float32 Input/Output Tensors

### Describe the issue After quantization, the output ONNX model had faster inference speed and smaller model size, but why are the input and output tensors still float32? I thought it should be u…

jenchun-potentialmotors updated 3 weeks ago
2
thu-nics/ViDiT-Q #1

Cannot find 'config.yaml' and 'opensora_config.py'

Hi, I faced the issue when I tried to run **6.1 normal inference** and **6.2 inference with mixed precision** as your indications. But something was wrong: **For 6.1 normal inference:** (viditq…

zjq0455 updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for post-training-quantization

1000+ results
for post-training-quantization