post-training-quantization Search Results

1000+ results
for post-training-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thu-nics/qllm-eval #6

Does KV cache belong to Activation?

The survey discusses the sensitivity of activation quantization and the tolerance of KV cache quantization in the context of post-training quantization (PTQ) for large language models (LLMs). It makes…

pprp updated 5 months ago
1
tensorflow/tpu #549

efficientnet top1 acc 15%+ down after post-training quantiza…

using checkpoint: https://storage.googleapis.com/cloud-tpu-checkpoints/efficientnet/ckptsaug/efficientnet-b0.tar.gz export_model.py setting: `python export_model.py --ckpt_dir=efficientne…

imcaspar updated 4 years ago
14
tensorflow/models #8935

TF2 Object Detect API Quantization Aware Training

Great to see the Tensorflow 2 Object Detect API has been released. One feature I'm very interested in is quantization aware training (as is supported in the Tensorflow 1 version). I'm assuming it's …

mm7721 updated 1 month ago
22
NVIDIA/TensorRT-Model-Optimizer #51

Weird Bug when QAT training with HfArgumentParser

take the following code as simple example: > parser = transformers.HfArgumentParser( > (ModelArguments, DataArguments, TrainingArguments, LoraArguments) > ) > ( > _m…

ShadowTeamCN updated 1 month ago
10
huggingface/peft #2016

Cannot apply both PEFT QLoRA and DeepSpeed ZeRO3

### System Info ```Shell - `Accelerate` version: 0.33.0 - Platform: Linux-5.15.133+-x86_64-with-glibc2.35 - `accelerate` bash location: /opt/conda/bin/accelerate - Python version: 3.10.14 - Nu…

echo-yi updated 1 week ago
9
openvinotoolkit/nncf #2766

[TorchFX] Torch FX/PyTorch 2 Export Quantization

### 🚀 Feature request Quantization is a widely used technique to accelerate models, particularly when using the [torch.compile](https://pytorch.org/tutorials/intermediate/torch_compile_tutorial.htm…

alexsu52 updated 3 months ago
2
tensorflow/models #9320

[TF2] [Object Detection API] Post training quantization for …

# Prerequisites Please answer the following question for yourself before submitting an issue. - [x] I checked to make sure that this issue has not been filed already. ## 1. The entire URL of …

ItsMeTheBee updated 3 years ago
4
bytecodealliance/wasm-micro-runtime #2611

WASI-NN should not apply input quantization

Currently, the TFLite wasi-nn implementation performs quantization if quantization scale and zero-point exist (https://github.com/bytecodealliance/wasm-micro-runtime/blob/main/core/iwasm/libraries/was…

CIPop updated 3 months ago
5
mlc-ai/mlc-llm #2273

Phi-3 mini 4k instruct with MICROSOFT's quantization

## ⚙️ Request New Models - Link to an existing implementation (e.g. Hugging Face/Github): https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf - Is this model architecture supported by ML…

federicoparra updated 2 months ago
3
apple/coremltools #2227

need help about both model weight and activation quantizatio…

from the issue "https://developer.apple.com/forums/thread/740518 how do we use the computational power of A17 Pro Neural Engine?" I learn that if i want to inference my mlmodel on my ipad pro with …

AndreaChiChengdu updated 4 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for post-training-quantization

1000+ results
for post-training-quantization