model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AutoGPTQ/AutoGPTQ #634

Error when quantizing mixtral 8x7b model. "ZeroDivisionErr…

I am getting "float division by zero" error whenever I try to quantize mixtral related models with autogptq, and here is my code. ``` from transformers import AutoTokenizer, TextGenerationPipeli…

arceus-jia updated 1 month ago
3
sczhou/CodeFormer #217

How to do optimization and quantization codeformer.pth model…

I want to speed up inference of `codeformer.pth` model, how can i do optimization and quantization this?

miladfa7 updated 1 year ago
2
pytorch/torchtune #1349

RuntimeError: Index put requires the source and destination …

Hi, I am trying to apply the generate recipe on a quantized llama 3.1 8B model but run into the following error: ``` ... File "/home/mreso/torchtune/torchtune/modules/attention.py", line 211, …

mreso updated 3 months ago
6
NVlabs/VILA #53

working with VLLM

I'm wondering if I can get an easier pipeline by loading the awq weights with vllm: ``` from vllm import LLM, SamplingParams prompts = [ "Hello, my name is", "The president of the Uni…

kousun12 updated 4 months ago
2
mit-han-lab/qserve #29

LLama-3-8B model dumped by LMQuant in 4w8a set raises errors…

I dumped the quantised llama-3-8B model from LMQuant, using QoQ, the command as follows written in [lmquant](https://github.com/mit-han-lab/lmquant/tree/main)/[projects](https://github.com/mit-han-l…

Patrick-Lew updated 1 month ago
1
tensorflow/model-optimization #556

AttributeError: 'NoOpActivation' object has no attribute '__…

Traceback (most recent call last): File "train_mobilenetv2_quantization.py", line 368, in base_model = quantize_model(base_model) File "/home/huangfei/anaconda3/envs/ImageSearch2/lib/pytho…

FredHuang16 updated 3 years ago
3
tensorflow/model-optimization #1085

AttributeError: 'CustomLayerMaxPooling1D' object has no attr…

Hi, when I am trying Quantization Aware Training on my model, I get the following error in my 'CustomLayerMaxPooling1D' : --------------------------------------------------------------------------- …

konstantinatopali updated 1 year ago
1
HSqure/ultralytics-pt-yolov3-vitis-ai-edge #3

compile the model

Hi! When i run 'python quant.py --quant_mode test --subset_len 1 --batch_size 1 --deploy ',i get this error: [VAIQ_NOTE]: =>Quantizable module is generated.(quantize_result/Model.py) [VAIQ_NOTE]:…

shoayi updated 2 years ago
1
pytorch/pytorch #88505

quantization: error message when using `convert_fx` on a mod…

### 🐛 Describe the bug When a user tries to use `convert_fx` on a model which is on cuda, the error message doesn't make sense. We should either throw an error message which asks the user to move th…

vkuzo updated 1 year ago
2
ggerganov/whisper.cpp #2458

large-v3-turbo-q5_0 cannot be converted for CoreML

I'm on an Apple Silicon Mac trying to convert a CoreML model for `large-v3-turbo-q5_0`. What is needed in order to convert this model? ``` ./models/generate-coreml-model.sh large-v3-turbo-q5_0 …

bricolage updated 1 month ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization