qwen1-5 Search Results - Githubissues

vllm-project/vllm #5957

[Bug]: qwen1.5-32b-chat no response

### Your current environment vllm 0.5.0.post ### 🐛 Describe the bug vllm 0.5.0.post transformers

linpan updated 1 week ago

NVIDIA/TensorRT-LLM #1666

convert qwen1.5-32b-chat failed

qwen# python convert_checkpoint.py --model_dir /code/tensorrt-llm/Qwen1.5-32B-Chat/ --output_dir ./trt_ckpt/qwen1.5-32b/fp16 --dtype float16 --tp_size 4 [TensorRT-LLM] TensorRT-LLM version: 0.11.0.de…

Fred-cell updated 1 week ago

NVIDIA/TensorRT-LLM #1812

Qwen1.5 Model 'tensorrt_llm' loading failed with error: key …

### System Info - 20.04 Ubuntu - NVIDIA H800 - CUDA version 11.8 ### Who can help? @kaiyux @byshiue ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks …

HowardChenRV updated 3 days ago

huggingface/text-embeddings-inference #261

Support gte-Qwen1.5-7B-instruct

### Model description Here is the model description > gte-Qwen1.5-7B-instruct is the latest addition to the gte embedding family. This model has been engineered starting from the [Qwen1.5-7B](https:…

reverland updated 4 days ago

ollama/ollama #3410

Qwen1.5-MoE

### What model would you like? The code of Qwen1.5-MoE has been in the latest Hugging face transformers and we advise you to build from source with command , or you might encounter the following erro…

wuming123 updated 2 months ago

microsoft/DeepSpeed-MII #442

qwen1.5 model Support?

musexiaoluo updated 1 month ago

sophgo/sophon-demo #42

qwen2推理异常

安装pip install ./dist/sophon-3.7.0-py3-none-any.whl --force-reinstall 然后运行python python/qwen1_5.py --bmodel models/BM1684X/qwen1.5-1.8b_int4_1dev.bmodel --token python/token_config --dev_id 0 ，发生错误：…

storyxlx updated 6 days ago

bentoml/OpenLLM #948

feat: support Qwen1.5

### Feature request https://github.com/QwenLM/Qwen1.5 https://huggingface.co/collections/Qwen/qwen15-65c0a2f577b1ecb76d786524 ### Motivation _No response_ ### Other _No response_

sudazzk updated 2 months ago

QwenLM/qwen.cpp #80

qwen1.5 support?

root@a:~/qwen/qwen.cpp/qwen_cpp# python3 convert.py -i /root/qwen/Qwen1.5-1.8B -t q4_0 -o qwen1_8b.bin Special tokens have been added in the vocabulary, make sure the associated word embeddings are f…

anan1213095357 updated 3 months ago

NVIDIA/TensorRT-LLM #1208

When will support qwen1.5

python build.py --hf_model_dir /app/model/Qwen1.5-14B-Chat \ --dtype float16 \ --remove_input_padding \ --use_gemm_plugin float16 \ …

mogoxx updated 3 weeks ago

1000+ results for qwen1-5

1000+ results
for qwen1-5