qwen1-5 Search Results - Githubissues

1000+ results
for qwen1-5

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

KaihuaTang/Qwen-Tokenizer-Pruner #2

你好，什么时候支持Qwen1.5？

WWJY updated 1 month ago
1
xorbitsai/inference #2377

qwen2.5-instruct不支持tool calls

### System Info / 系統信息 ubuntu22 ### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？ - [X] docker / docker - [ ] pip install / 通过 pip install 安装 - [ ] installation from source / 从源码安装 …

yebanliuying updated 3 days ago
3
microsoft/DeepSpeed-MII #442

qwen1.5 model Support?

musexiaoluo updated 4 months ago
3
OpenPPL/ppl.llm.serving #64

支持qwen1.5或者qwen2吗？

用 ppl.pmx Export 导出模型，有大量的警告， Warning: The shape interface of opmx::XX（如 ParallelEmbedding、ColumnParallelLinear、Reshape等） type is missing，用转出来的 onnx 格式的文件启动 ppl_llm_server，提示 unsupported op: domain[op…

Flynn-Zh updated 2 months ago
1
InternLM/xtuner #936

请问目前支持qwen2吗？

我看文档里只写支持到qwen1.5，但是issue里不少人有用在qwen2上？

Zheng-Jay updated 1 week ago
1
intel-analytics/ipex-llm #12015

Inference speed and memory usage of Qwen1.5-14b

I have tested the inference speed and memory usage of Qwen1.5-14b on my machine using the example in ipex-llm. The peek cpu usage to load Qwen1.5-14b in 4-bit is about 24GB. The peek GPU usage is abou…

WeiguangHan updated 3 weeks ago
3
NVIDIA/TensorRT-LLM #1666

convert qwen1.5-32b-chat failed

qwen# python convert_checkpoint.py --model_dir /code/tensorrt-llm/Qwen1.5-32B-Chat/ --output_dir ./trt_ckpt/qwen1.5-32b/fp16 --dtype float16 --tp_size 4 [TensorRT-LLM] TensorRT-LLM version: 0.11.0.de…

Fred-cell updated 2 months ago
18
PaddlePaddle/PaddleNLP #8663

【LLM】模型参数支持列表

# 模型参数支持专区大家好，PaddleNLP 团队在这里为大家整理了各个模型参数的详细信息，方便大家使用。 ## 模型参数 ### Base Models | Model | 0.5B | 1~2B | 3~4B | 6~8B | 13~14B | 30~32B | 50~60B | 65~72B | 110B | >110B | |:---------:|:--…

DrownFish19 updated 2 weeks ago
1
huggingface/text-embeddings-inference #261

Support gte-Qwen1.5-7B-instruct

### Model description Here is the model description > gte-Qwen1.5-7B-instruct is the latest addition to the gte embedding family. This model has been engineered starting from the [Qwen1.5-7B](https:…

reverland updated 3 months ago
1
modelscope/ms-swift #2084

npu 推理和部署怎么设置多卡

我的测试脚本： ``` NPROC_PER_NODE=8 \ ASCEND_RT_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \ HCCL_SOME_VARIABLE=value swift infer \ --model_type '/data2/dxc/Qwen1.5-32B-Chat' \ --load_args_from_ckpt_dir …

klaus-duan updated 1 day ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for qwen1-5

1000+ results
for qwen1-5