quantization Search Results

1000+ results
for quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

instructlab/training #28

Add quantization option for full fine-tuning

We want to support the ability to run a full fine-tune with just 8 bit quantization.

RobotSail updated 4 days ago
1
alibaba/TinyNeuralNetwork #374

How to quantize ViT model with quantization aware training

It can train the ViT model from the Hugging Face transformer, but when converting to tflite model it appear an error message that I can't solve it. The following are the tinynn setting and the error…

Linsop2 updated 3 weeks ago
3
vllm-project/llm-compressor #925

Encounter error "No modifier of type 'SparseGPTModifier' fou…

**Describe the bug** When I run `examples/quantization_2of4_sparse_w4a16/llama7b_sparse_w4a16.py`, I encounter the error "No modifier of type 'SparseGPTModifier' found". The version I used is 0.3.0. …

jiangjiadi updated 4 days ago
1
vllm-project/vllm #10151

[Usage]: Error executing method determine_num_available_bloc…

### Your current environment I want to deply neuralmagic/DeepSeek-Coder-V2-Instruct-FP8 with 8 x NVIDIA L20， use -tensor-parallel-size=8 --enforce-eager --trust-remote-code --quantization=fp8 --kv…

SamuelScc updated 3 days ago
2
pytorch/executorch #6212

Support QAT in QCOM qnn backend

### 🚀 The feature, motivation and pitch Currently qnn quantizer only supports PTQ (post training quantization), and we'd like to enable QAT (quantization aware trainning) for better quantization supp…

cccclai updated 1 month ago
3
pgvector/pgvector #605

Product Quantization

Hi, Me and my friend have been reading the code for a while and we were looking for some ideas for contributing. @ankane, you mentioned product quantization in #27. Is this still an issue? We would …

aminst updated 4 months ago
3
unslothai/unsloth #1266

Couldn't build proto file into descriptor pool! Invalid prot…

%%capture !pip install unsloth "xformers==0.0.28.post2" # Also get the latest nightly Unsloth! !pip uninstall unsloth -y && pip install --upgrade --no-cache-dir "unsloth[colab-new] @ git+https://gi…

CurtiusSimplus updated 1 week ago
9
HyperGAI/HPT #9

Quantization

Hi, thank you for this work. How to quantize it to use int8? Any comments are appreciated.

IceTea42 updated 6 months ago
1
efeslab/Atom #23

Question about KV Cache quantization

Hi, thanks for your great work! I have a small question about KV Cache quantization. Did you use pagedattention to accelerate KV Cache 4-bit quantization? If so, where is the corresponding cuda kerne…

SherrySwift updated 2 months ago
3
manticoresoftware/manticoresearch #1809

Vector quantization for KNN search

Currently, Manticore uses the HNSW index over floats for its KNN search implementation. That might lead to excessive memory consumption, as all HNSW indexes must be loaded into RAM. One way to improve…

glookka updated 1 week ago
3

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for quantization

1000+ results
for quantization