awq Search Results - Githubissues

1000+ results
for awq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cmeraki/indri #27

Model quantization

Fp8 or AWQ quant

romitjain updated 2 weeks ago
1
casper-hansen/AutoAWQ #559

cant import awq

### from the new version，I build it but I cant import awq, - transformers 4.43.3 - torch 2.3.1 - torchaudio 2.4.0 - torchvision 0.19.0 …

Dujianhua1008 updated 1 month ago
7
NVIDIA/TensorRT-LLM #2329

Why speed does not increase with AWQ

Why speed does not increase with AWQ? I have a model gemma 2 9B. With one A100. with float16 the benchmark is 4267.62 tokens per second with awq 4 bit the benchmark is 4963.73 tokens per second I ex…

Alireza3242 updated 4 days ago
6
vllm-project/vllm #10243

[Bug]: Deepseek V2 coder 236B awq error!

### Your current environment Collecting environment information... WARNING 11-12 05:39:35 _custom_ops.py:19] Failed to import from vllm._C with ModuleNotFoundError("No module named 'vllm._C'") Warn…

tohnee updated 2 weeks ago
4
vllm-project/vllm #10773

[Bug]: AttributeError: 'Qwen2Model' object has no attribute …

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### Model Input Dumps from awq import AutoAWQForCausalLM fro…

Alex-DeepL updated 1 day ago
1
NVIDIA/TensorRT-LLM #1792

Fail to build w4a8_awq/int4_awq on Llama3-8B

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm 0.11.0.dev2024052100 nvidia L40s ### Who can help? …

Hongbosherlock updated 2 weeks ago
9
InternLM/lmdeploy #2770

[Bug] The quantization process of Qwen/Qwen2-VL-7B-Instruct…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related iss…

vjaideep08 updated 1 week ago
2
huggingface/optimum-habana #1240

AWQ is not working

### System Info ```shell Transformers fails with the following error, when trying to use AWQ with TGI / neural compression enginer, or optimum habana ValueError: AWQ is only available on GPU ``` #…

endomorphosis updated 3 months ago
4
NVIDIA/TensorRT-LLM #2466

Performance issue with batching

### System Info x86_64, Debian 11, L4 GPU ### Who can help? _No response_ ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supporte…

ShuaiShao93 updated 1 week ago
1
casper-hansen/AutoAWQ #602

assert self.in_features % self.group_size == 0

File "/root/ld/ld_project/pull_request/MiniCPM-V/web_demo_2.6.py", line 44, in model = AutoModel.from_pretrained(model_path, trust_remote_code=True) File "/root/ld/conda/envs/minicpm/lib/py…

LDLINGLINGLING updated 1 week ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for awq

1000+ results
for awq