auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ModelCloud/GPTQModel #194

[BUG] tip/main has regression on optional vllm depend

vLLM was merged into main but there remains runtime depend issues. vLLM is a large pkg we will not force dependency on it. vLLM should runtime import and error if not exists and prompt users to instal…

Qubitium updated 4 months ago
1
InternLM/lmdeploy #2554

[Bug] does TurboMind support Qwen2-VL-2B-Instruct in lmdeplo…

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. - [ ] 3. Please note that if the bug-related issue y…

LinJianping updated 1 month ago
1
BoundaryML/baml #969

Open source models support

Hello, First of all thank you for bringing this amazing tool! I was wondering if there is any chance of integrating open-source LMM models like for example https://huggingface.co/Qwen/Qwen2-VL-7B-…

NikiBase updated 1 month ago
17
sgl-project/sglang #710

[Feature] Support for LLama 3.1

### Motivation Since Llama3.1 is already released. I tested with gptq quant and it doesn't work. ```bash Traceback (most recent call last): sglang | File "/usr/lib/python3.10/runpy.py", line…

TimilsinaBimal updated 3 months ago
6
NVIDIA/TensorRT-LLM #1865

can not run whisper on T4

### System Info x86_64 755G nvidia T4 ubuntu 22.04 trtllm version : https://github.com/NVIDIA/TensorRT-LLM/archive/9691e12bce7ae1c126c435a049eb516eb119486c.zip pip install tensorrt-llm==0.11…

ZJU-lishuang updated 5 days ago
5
huggingface/transformers #32671

RuntimeError: Failed to import transformers.integrations.bit…

# System Info Package Version ------------------------ ---------- accelerate 0.33.0 bitsandbytes 0.43.3 transformers 4.44.0 ### Who c…

fsaudm updated 3 months ago
1
rustformers/llm #244

Write a 0.2 changelog

There's been quite a few changes from 0.1. We should document them for people updating their applications.

philpax updated 1 year ago
8
unslothai/unsloth #877

Issue Report: Inconsistent Behavior and Meaningless Output

Thank you for your work. However, I've noticed some performance issues that differ significantly when compared to the Llama 3.1 model. Specifically, I've observed the following problems: # Issue Desc…

seolhokim updated 3 months ago
4
eosphoros-ai/DB-GPT #1689

DB-GPT use one-ke model error.

### Search before asking - [X] I had searched in the [issues](https://github.com/eosphoros-ai/DB-GPT/issues?q=is%3Aissue) and found no similar issues. ### Operating system information Linux ### P…

zhuweigang updated 3 months ago
3
modelscope/FunASR #2152

docker在容器内启动服务是报错

请看一下日志我错过了什么？谢谢根据此部署指南：https://github.com/modelscope/FunASR/blob/main/runtime/docs/SDK_advanced_guide_offline_gpu.md 执行如下命令： ``` docker pull registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:f…

xiasi0 updated 1 month ago
1

上一页 1...73 74 75 76 77 78 79...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant