qwen1-5 Search Results - Githubissues

1000+ results
for qwen1-5

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

wangzhaode/mnn-llm #186

A: libc++abi: terminating due to uncaught exception of type …

A: libc++abi: terminating due to uncaught exception of type std::out_of_range: unordered_map::at: key not found [1] 58628 abort ./cli_demo ../qwen1.5-0.5b-chat

CoderXXLee updated 4 months ago
3
QwenLM/Qwen2.5 #562

[QA] Number of training tokens

Hi, What is the number of training tokens for the base models Qwen 1.5 0.5B, 1.8B, 4B, 7B, 14B, 72B,110B、Qwen2 7B 72B models? Thanks for the great work!

cslydia updated 3 months ago
2
xorbitsai/inference #1496

The accidental triggering of a KeyError exception can cause…

### Describe the bug I use docker-compose to deploy Xinference. Most of the time it works fine, but at some random moment, a KeyError is triggered, causing the entire service to fail. Here are my ste…

yinghaodang updated 2 months ago
2
huggingface/transformers #30827

Using this command(optimum-cli export onnx --model Qwen1.5-0…

### System Info transformers version : 4.38.1 platform: ubuntu 22.04 python version : 3.10.14 optimum version : 1.19.2 ### Who can help? @ArthurZucker and @younesbelkada ### Information - [X] …

JameslaoA updated 4 months ago
10
QwenLM/Qwen2.5 #444

vllm部署qwen如何统计token数量呢？

openai的模型可使用tiktoken这个包请问qwen模型该如何统计流式输出的token数量呢？ ` llm2 = ChatOpenAI( model="/vllm/qwen1.5-chat-moe", openai_api_key = 'xxx', openai_api_base…

dongteng updated 5 months ago
3
vllm-project/vllm #5692

[Bug]:Qwen2-57B-A14B 两卡推理报错

### Your current environment 环境： torch 2.3.0 vllm 0.5.0.post1 transformers 4.41.2 主要报错情况： moe小一点的模型 '/data/models/qwen/qwen1.5-2.7Bmoe' 不会出问题对于大一点的就报错如最下面。代码： from vllm.engine.arg_ut…

CXLiang123 updated 2 months ago
8
explodinggradients/ragas #853

Issues regarding ragas custom model

Hi folks, please report issues regarding the ragas custom critic model in this thread. Docs : https://docs.ragas.io/en/latest/howtos/customisations/ragas_custom_model.html Model: https://huggingfa…

shahules786 updated 4 months ago
4
QwenLM/Qwen2.5 #505

ollama qwen2:7b 乱码，只会回复"GGG GGGGG”

- 系统: debian 12 - ollama: 0.1.40 - qwen1.5 正常， qwen2 不正常：

nne998 updated 4 months ago
1
xorbitsai/inference #1847

Xinference最新版本启动加载模型很容易崩溃

错误截图： ![图片](https://github.com/xorbitsai/inference/assets/29749635/27488b41-296f-4fe7-8c64-4aa0cf78cd03) 模型：Qwen1.5-14B-Chat-GPTQ-int4 加载引擎：vllm 错误信息： torch.cuda.OutOfMemoryError: [address=0.0.0.…

worm128 updated 3 months ago
1
thunlp/InfLLM #16

Qwen1.5-72B-Chat-GPTQ-Int4

请问下，能直接跑 Qwen1.5-72B-Chat-GPTQ-Int4 模型吗？

ChuanhongLi updated 6 months ago
2

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for qwen1-5

1000+ results
for qwen1-5