awq Search Results - Githubissues

1000+ results
for awq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #2948

AWQ Quantization Memory Usage

Hello! First of all, great job with this inference engine! Thanks a lot for your work! Here's my issue: I have run vllm with both a mistral instruct model and it's AWQ quantized version. I've quant…

vcivan updated 4 weeks ago
5
mit-han-lab/llm-awq #124

can not install awq CUDA kernels

I‘m trying to follow [this](https://github.com/mit-han-lab/llm-awq#install) to install awq. But failed at step 3. ## My Env ``` OS: Windows 11 GPU: NVIDIA GeForce RTX4060 Driver Version: 536.4…

ycyaoxdu updated 1 week ago
8
QwenLM/Qwen2.5 #1101

"llama_model_load: error loading model: check_tensor_dims: t…

When I quantified the Qwen2.5-1.5B-instruct model according to **"Quantizing the GGUF with AWQ Scale"** of [docs](https://qwen.readthedocs.io/en/latest/quantization/llama.cpp.html) , it showed that th…

Autism-al updated 2 days ago
2
unslothai/unsloth #464

AWQ support

I have faced an error with the VLLM framework when I tried to inferencing an Unsloth fine-tuned LLAMA3-8b model... ### Error: (venv) ubuntu@ip-192-168-68-10:~/ans/vllm-server$ python -O -u -m vl…

anslin-raj updated 3 months ago
16
mit-han-lab/llm-awq #210

How to generate awq quantized model for llava-1.5-7b-hf

I'm trying to quantize llava-1.5 according to the `readme.md` with the following scripts, and tells:`AttributeError: 'LlavaConfig' object has no attribute 'mm_vision_tower'`. It seems like the llava…

XiaotaoChen updated 3 weeks ago
1
casper-hansen/AutoAWQ #657

probability tensor contains either inf, nan or element < 0

Hi Im trying to do inference on a awq quantized model and im constantly getting this error when trying to generate text. Im using Qwen2.5-72B-Instruct-AWQ. Some code to give context: sel…

alvaropastor7 updated 2 days ago
1
NVIDIA/TensorRT-LLM #2392

Qwen2-72B w4a8 empty output

### System Info GPU: 4090 Tensorrt: 10.3 tensorrt-llm: 0.13.0.dev2024081300 ### Who can help? @Tracin May you please have a look, thank you very much ### Information - [ ] The official example sc…

lishicheng1996 updated 2 weeks ago
4
vllm-project/vllm #6142

[Feature]: deepseek-v2 awq support

### 🚀 The feature, motivation and pitch Is the deepseek-v2 AWQ version supported now? When I run it, I get the following error: ``` [rank0]: File "/usr/local/lib/python3.9/dist-packages/vllm/mo…

fengyang95 updated 4 weeks ago
9
AutoGPTQ/AutoGPTQ #745

[BUG] 支持4bit量化ChatGLM-4-9B-Chat 和 ChatGLM3-6B 这两个模型吗？

--------------------------------------------------------------------------- TypeError Traceback (most recent call last) Cell In[2], [line 2](vscode-notebook-cell:?exe…

shawn9977 updated 2 weeks ago
1
triton-inference-server/tensorrtllm_backend #566

Build Qwen2-72B model to INT4-AWQ TensorRT engines failed

### System Info - Ubuntu 20.04 - NVIDIA A100 ### Who can help? @Tracin @kaiyux ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] A…

wangpeilin updated 2 weeks ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for awq

1000+ results
for awq