ptq Search Results - Githubissues

1000+ results
for ptq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-Model-Optimizer #24

Error when quantize Phi-3 to fp8, AssertionError: <class 'tr…

within the docker (IMAGE: nvidia/cuda:12.1.0-devel-ubuntu22.04) GPU: A100 40GB TensorRT-LLM version: 0.10.0 flash-attn 2.5.9.post1 I quantize the phi3 model(phi-3-medium-128k-instrcut/), wi…

Ross-Fan updated 5 months ago
3
meituan/YOLOv6 #492

ptq sensitivity_analyse 转化出现下面错误

![f654737ebc54932e591723efc3d1c02](https://user-images.githubusercontent.com/47971541/191495874-577ca7c6-9dc6-4d53-8ce3-8c6a1e3a4226.png)

pyl62112991 updated 2 years ago
1
pytorch/pytorch #141015

When loading a model trained with QAT `(qat.pth)` using `mod…

### 🐛 Describe the bug - I'm reporting this issue due to errors related to capture_pre_autograd_graph and torch.compile in QAT. - Note: Apologies if there are any misunderstandings. - Based on th…

Taku-777 updated 2 days ago
1
alibaba/TinyNeuralNetwork #374

How to quantize ViT model with quantization aware training

It can train the ViT model from the Hugging Face transformer, but when converting to tflite model it appear an error message that I can't solve it. The following are the tinynn setting and the error…

Linsop2 updated 3 weeks ago
3
alibaba/TinyNeuralNetwork #95

A PTQ tflite model fails to pass benchmark test

My use case: Apply post training quantization to a pth model and convert to tflite. The generated tflite model fails to pass benchmark test with following error message: STARTING! Log parameter val…

liamsun2019 updated 1 year ago
6
myavartanoo/PolyNet_PyTorch #1

links to the dataset does not exist

Hi, Thanks for the repo you published on github, I tried to use the links [PTQ] and [√3-subdivision] and seems the links are broken. could you please fix this? Best

hex41434 updated 2 years ago
4
PaddlePaddle/PaddleNLP #6105

[Question]: ptq量化时报错：Operator (fusion_unified_decoding) is …

### 请提出你的问题使用paddleslim 量化unimo时报错：Operator (fusion_unified_decoding) is not registered. 转为静态图后好像不支持’fusion_unified_decoding‘算子，有什么办法可以支持该算子（例如如何register）？ > Preparation stage, Run batch:| …

Ji-Tian updated 6 months ago
1
NVIDIA/TensorRT #3865

Improving int8 quantization results.

I have used PTQ for int8 export from pytorch model and despite attempts at calibration, there is a significant drop in detection accuracy. I am moving to quantization aware training to improve the…

severecoder updated 6 months ago
3
mit-han-lab/llm-awq #60

Integration with TensorRT?

Would love to use this as a PTQ layer with TensorRT. Are there any plans to support that in the future?

bryanhpchiang updated 1 year ago
1
NVIDIA/TensorRT-Model-Optimizer #23

Cannot export model to the model_config

I am trying to quantize and export to tensorrt engine a llama 3 finetuned [model ](https://huggingface.co/damerajee/Gaja-v1.00). But I am able to quantize the model but however I am unable to export t…

ashwin-js updated 3 months ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for ptq

1000+ results
for ptq