int8 Search Results - Githubissues

1000+ results
for int8

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #22040

[Training]

### Describe the issue Reduce-range does not improve the metric ### To reproduce I'm using the reduce-range feature. Quantization is calculated symmetrically, in QDQ format, for int8. But…

anna244 updated 3 days ago
1
vespa-engine/vespa #32232

Optimize MaxSim with hamming (sum of max inverted hamming di…

Multi-vector MaxSim is increasingly important and we have optimizations for float cell precision, but I think we should also consider optimize for int8 with hamming as it approximates the dotproduct f…

jobergum updated 1 week ago
2
NVIDIA/cutlass #1663

[BUG] e4m3, int8, bf16 pytorch emitter not working

I am attempting to emit pytorch code but unfortunately it does not work for fp8, bf16, and int8. I have tried to patch the converter type dict https://github.com/OrenLeung/cutlass/commit/6d619c964eb8b…

OrenLeung updated 2 weeks ago
3
NVIDIA/TensorRT-Model-Optimizer #46

CNN model opt int8 best practice example

Hi, can you share best practices for quantization for CNN models? Are the modelopt quantized PTQ is the way to go with tensorrt for cnn models (resnet retinanet etc)? I was able to quantize retinanet…

korkland updated 1 month ago
3
pytorch/pytorch #129889

custom gradient for int8

### 🚀 The feature, motivation and pitch **Feature motivation:** [Default pyTorch quantization aware training](https://pytorch.org/docs/stable/quantization.html) uses "fake-quantization" approach. Fo…

rybakov updated 2 months ago
2
xorbitsai/inference #2236

支持全部系列的inflight bnb

### Feature request / 功能建议当前vllm sglang引擎下Qwen系列暂无法使用int4 int8量化 ### Motivation / 动机当前vllm sglang引擎下Qwen系列暂无法使用int4 int8量化 ### Your contribution / 您的贡献 none

PolarPeak updated 2 days ago
1
k2-fsa/sherpa #608

What about int8 weighs?

hello! I build int8 weights: INFERENCE_PRECISION=float16 WEIGHT_ONLY_PRECISION=int8 MAX_BEAM_WIDTH=4 MAX_BATCH_SIZE=8 checkpoint_dir=whisper_large_v3_weights_${WEIGHT_ONLY_PRECISION} output_dir…

AntonThai2022 updated 3 months ago
3
tensorly/tensorly #553

Tensorly support float16/int8

KaidDuong updated 2 months ago
2
microsoft/onnxruntime #21979

[Performance] Increasing Memory Usage during INT8 Quantizati…

### Describe the issue Hello, I'm trying to quantize an ONNX model to INT8 using the ONNX Runtime tools provided [here](https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/python/tools/…

noujaimc updated 1 week ago
2
IuvenisSapiens/ComfyUI_Qwen2-VL-Instruct #2

Please added Qwen2-VL-GPTQ models support

Qwen has released some quantized models Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4 Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int4 Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8 Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8 since t…

kiron111 updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for int8

1000+ results
for int8