quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #4714

[Bug]: export failed when kv cache fp8 quantizing Qwen1.5-72…

### Your current environment pip3 install vllm==0.4.2 nvidia-ammo==0.7.1 Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: …

frankxyy updated 2 weeks ago
2
Oufattole/meds-torch #19

Explore different numerical Value embedding strategies

We currently only support Continuous value embeddings (a one to many FFN). We should try other things, like supporting quantizing.

Oufattole updated 3 months ago
2
haotian-liu/LLaVA #310

[Question] Quantizing to use with oobabooga/text-generation-…

### Question I downloaded llava-llama-2-13b from: https://huggingface.co/liuhaotian/llava-llama-2-13b-chat-lightning-preview Then I've quantized the model to 4-bit using . ``` git clone htt…

chigkim updated 3 months ago
11
tensorflow/model-optimization #841

problem with quantizing the BN layer

Hello, I am trying to perform a QAT on a ResNet50 network with BN layers, and I keep getting the following error: ``` ValueError: Shape must be rank 4 but is rank 5 for '{{node batch_normalization_…

lovodkin93 updated 7 months ago
6
sgl-project/sglang #1351

[Feature] KV Cache Quantization

### Checklist - [X] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.…

ghost updated 1 month ago
3
leejet/stable-diffusion.cpp #446

Convert: generated K quants are useless for SD3.5 Large mode…

Quantizing Stable Diffusion 3.5 models to any kind of k-quants results in large files made out of mostly fp16 weights. That's because a lot of tensors have width 2432 or 7296, wich do not fit in the…

stduhpf updated 2 weeks ago
1
tensorflow/model-optimization #1109

strange behavior when quantizing a model.

Hi all, I was trying to quantize my model but something strange popped up. I am using TensorFlow v2.14 and tfmot v0.7.5 I have a sub-classed tf.Keras.Model. It contains some custom layers and…

IdrissARM updated 9 months ago
2
intel/tools #5

Quantizing the Official TF Resnet Model

I'm trying compare the accuracies between resnet and its quantized version. First, I downloaded the resnet_v1 saved_model and used tensorflow's freeze_graph tool to freeze the graph. I then followe…

legitmaxwu updated 5 years ago
1
ggerganov/llama.cpp #10122

Bug: llama-quantize --help is not printed

### What happened? Appended `--help` does not print help immediately, but starts quantization or throws error: ```shell ./llama-quantize model-bf16.gguf --help IQ4_NL ./llama-quantize model-bf16.g…

ivanstepanovftw updated 4 days ago
4
UKPLab/sentence-transformers #2748

fine tuning quantized models

Is it possible to do the fine tuning quantizing the models and using qlora?

claracaste updated 4 months ago
11

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for quantizing

1000+ results
for quantizing