quant Search Results - Githubissues

1000+ results
for quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mobiusml/hqq #130

8bit + Aten + compile

When I try to run patch_model_for_compiled_runtime on 8bit + aten, the program reports an error. How can I solve this problem? ![image](https://github.com/user-attachments/assets/f0a85477-f36e-4081-b…

zhangy659 updated 2 weeks ago
6
NVIDIA/TensorRT-LLM #1770

Fail to build w4a8_awq on Llama 13b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm …

Hongbosherlock updated 4 days ago
12
triton-inference-server/server #7477

Exllamav2 inference with EXL Quants

Do you support Exllamav2 backend for the inference that supports exl quants? The current alternative is vllm but that doesn't support EXL quants. Also, after running a perplexity test, EXL is the b…

rjmehta1993 updated 3 months ago
1
Qcompiler/QComplier #6

optimize device memory

I found that the device memory usage keeps increasing when execute basic_quant_mix.py, it will raise OOM when model has large parameters, so, how to optimize it. Thank you~ @Qcompiler

Godlovecui updated 2 weeks ago
1
severak/alda-js #2

Functions Not Supported

`(quant 30)` is not supported by this project at all, and yet I don't think it is mentioned. Is there any other syntax that is not supported and not listed?

TNTErick updated 1 week ago
1
quantlibjs/ql #1

Support for ES Module Import in `@quantlib/ql

**Title:** Support for ES Module Import in `@quantlib/ql` **Description:** I encountered an error while predicting the next price using the `@quantlib/ql` library. The error message indicates th…

nguyennhukhanh updated 1 month ago
1
AndrianovAL/Quant-Project #1

Quant Project

Thank you for sharing this Quant Project, could you let me know which course this is from?

jamesliu1 updated 3 years ago
2
Cornell-RelaxML/QuIP #14

will it support group quant

I confused the method ldlq_Rg dont support group quantization.

oreo0906 updated 4 months ago
1
openmathdocs/mathbook #11

quant/quantity

I think I have a mess to clean up, which I will soon. I believe I finished the quantity element. I tried to streamline things for Rob, and so I rebased and squashed commits. I knew that he had alread…

Alex-Jordan updated 9 years ago
8
vllm-project/vllm #2339

awq compression of llama 2 70b chat got bad result

I use awq to quantize llama 2 70b-chat by: ``` CUDA_VISIBLE_DEVICES="1,2,3,4,5,6,7" python quantize_llama.py ``` the codes of quantize_llama.py： ``` from awq import AutoAWQForCausalLM from tr…

fancyerii updated 2 weeks ago
4

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for quant

1000+ results
for quant