qwen Search Results - Githubissues

1000+ results
for qwen

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xorbitsai/inference #2521

xinference部署的qwen模型怎样在后台看到request请求的日志及问的内容

### Feature request / 功能建议 xinference部署的qwen模型怎样在后台看到request请求的日志及问的内容 ### Motivation / 动机 xinference部署的qwen模型怎样在后台看到request请求的日志及问的内容 ### Your contribution / 您的贡献 xinference部署的qwen模型怎样在后台看到reque…

wangyongpenga updated 6 days ago
1
casper-hansen/AutoAWQ #645

awq version inference slower than unquantization version fp1…

Hi~ I have made a test on qwen-14b-chat, and very confused about the results. The results below show the original version fp16 is faster than the awq int4 version Is this expected ? thank you …

shanying2017 updated 2 days ago
3
infiniflow/ragflow #2230

QWEN API key

### Describe your problem [Question]: Hello everyone, I have just accidentally deleted the Tongue Qwen API and all my embedding works and inference works could not be proceeded. I’m using local LLM …

Alamkf updated 2 months ago
2
QwenLM/Qwen2-VL #522

cannot do awq quantization on qwen 2vl 7b

Hi there, I was struggling on how to implement quantization on autoawq as you mentioned in home page. I was trying to quantize 7b qwen2 vl but no matter I use 2 A100 80Gb vram, I still get cuda oom…

lebronjamesking updated 4 days ago
4
unslothai/unsloth #1063

vLLM Qwen 2.5 check

Current attempt: ```python def test_unsloth_vllm( max_length: int = 8192, use_4bit: bool = False, ): print('----> test_unsloth_vllm') import os from tra…

brando90 updated 1 month ago
14
Blaizzy/mlx-vlm #111

Support Chat API

Hope to support the API style for Chat (friendly to multi-turn conversations), currently it seems to be nearly supporting generate. For example, in a similar way to the code example like [Qwen/Qwen…

madroidmaq updated 4 days ago
4
Telosnex/fllama #6

Supported models?

I have been experimenting with different models in fllama, specifically Gemma, Phi3, and QWEN 2. I noticed significant differences in the performance and response quality across these models: Gemma…

Vinayak006 updated 5 days ago
2
appier-research/streambench-final-project #4

Example Implementations復現不了

在WSL下command: python -m examples.zeroshot --bench_name classification_public --model_name "Qwen/Qwen2.5-7B-Instruct" --device cuda --output_path /tmp/output.csv 和 python -m examples.self_streamicl …

waynehacking8 updated 1 day ago
1
NVIDIA/TensorRT-LLM #1666

convert qwen1.5-32b-chat failed

qwen# python convert_checkpoint.py --model_dir /code/tensorrt-llm/Qwen1.5-32B-Chat/ --output_dir ./trt_ckpt/qwen1.5-32b/fp16 --dtype float16 --tp_size 4 [TensorRT-LLM] TensorRT-LLM version: 0.11.0.de…

Fred-cell updated 3 days ago
19
spring-projects/spring-ai #992

qustion about ollama + qwen

Can Spring AI support the Qwen large language model? And can spring-ai-ollama-spring-boot-starter support function calling?

zzllkk2003 updated 2 months ago
7

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for qwen

1000+ results
for qwen