quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/model-optimization #841

problem with quantizing the BN layer

Hello, I am trying to perform a QAT on a ResNet50 network with BN layers, and I keep getting the following error: ``` ValueError: Shape must be rank 4 but is rank 5 for '{{node batch_normalization_…

lovodkin93 updated 6 months ago
6
ietf-rats-wg/draft-ietf-rats-corim #242

Complete the section on Quantisation of inputs

Need to complete the section - that deals with Quantizing Inputs {#sec-quantize}

yogeshbdeshpande updated 3 months ago
11
AutoGPTQ/AutoGPTQ #95

Error when quantizing GPT2-XL

When running `examples/quantization/basic_usage_gpt_xl.py` an error occurs during the model packing: ``` 2023-05-22 04:08:34 INFO [auto_gptq.quantization.gptq] duration: 0.16880011558532715 2023-…

lksj92hs updated 1 year ago
1
alibukharai/Blogs #9

Issue with supporting constant and softmax

Hi, Thannk you for the tutorial. I am using python 3.7 to be in line with version but having trouble quantizing the optimized model. "Generating the quantization table: Constant is not supporte…

raprakashvi updated 1 month ago
4
NVIDIA/TransformerEngine #965

How to cast 16/32-bit to FP8?

Hi, how to cast a float/bfloat16 tensor to fp8? I want to conduct W8A8 (fp8) quantization. But I didn't find an example of quantizing act to FP8 format.

mxjmtxrm updated 2 months ago
3
pytorch/ao #796

NotImplementedError: aten.linear.default not implemented whe…

Hey I'm using the MX datatypes. It seems like the aten.linear.default function has not been implemented which causes the linear layers in the attenion layers not work with the MX datatypes. Can you…

Ali-Flt updated 2 weeks ago
7
OpenNMT/CTranslate2 #1730

[feature request] Mixed quantizations.

From my own experience in text generation models, I found out that quantizing the output and embed tensors to f16 and the other tensors to q6_k (or q5_k) gives smaller files and better results that qu…

0wwafa updated 3 months ago
1
tensorflow/model-optimization #1109

strange behavior when quantizing a model.

Hi all, I was trying to quantize my model but something strange popped up. I am using TensorFlow v2.14 and tfmot v0.7.5 I have a sub-classed tf.Keras.Model. It contains some custom layers and…

IdrissARM updated 7 months ago
2
Tianxiaomo/pytorch-YOLOv4 #118

question about quantizing the model

Hi. First of all, thanks for the awesome work! This issue more of a question. I've been trying to quantize the yolov4 model (I excluded the postprocessing part of the model) by referencing this [tutor…

HtutLynn updated 1 year ago
4
casper-hansen/AutoAWQ #322

[BUG] Quantizing GPT NeoX raises an error

First of all, thank you for great work. ## System info autoawq==0.1.8 ## Details While I tried to quantize GPT NeoX model, encountered the error below. ``` >>> from awq import AutoAWQForCa…

kevin3314 updated 7 months ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for quantizing

1000+ results
for quantizing