qwen1-5 Search Results - Githubissues

1000+ results
for qwen1-5

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed-MII #457

[FEATURE REQUEST] Add Support for Qwen1.5-MoE Architecture i…

# Qwen1.5-MoE Support With the increasing attention on mixture-of-experts (MoE) models, especially following the advancements heralded by Mixtral, I propose considering the integration of the Qwen1.5…

freQuensy23-coder updated 3 months ago
1
xorbitsai/inference #1415

加载自定义模型 qwen1.5 0.5B chat

### 如上述文字自定义模型0.5B 是json文件这么写不对吗？请指教，json无法解析 ![image](https://github.com/xorbitsai/inference/assets/125621025/873e5d80-2f66-45f3-b234-97e729b81b32) ![image](https://github.com/xorbitsai/infer…

GXKIM updated 4 days ago
4
NVIDIA/TensorRT-LLM #1119

Can I use tensorrt-llm depoly qwen1.5?

hljjjmssyh updated 3 months ago
3
xorbitsai/inference #1630

新版本的xinference，glm4不支持 tools 能力？

使用 tools 调用"glm4-chat-1m"模型。报错如下错误。 openai.BadRequestError: Error code: 400 - {'detail': "Only ['chatglm3', 'gorilla-openfunctions-v1', 'qwen-chat', 'qwen1.5-chat'] support tool calls"}

HenryLiang updated 4 days ago
1
alibaba/rtp-llm #74

v0.2.0(cuda12)对比 v0.1.13(cuda11)表现下降

环境配置 A 环境 cuda12.1 v0.2.0 B 环境 cuda11.8 v0.1.13 硬件 A800单卡测试模型 qwen14B 单卡加载 int8推理环境变量如下配置 export CUDA_VISIBLE_DEVICES=1 export MODEL_TYPE=qwen_2 export ACT_TYPE=BF16 export WEIGHT_TYPE=…

invisifire updated 2 weeks ago
1
yuanzhoulvpi2017/zero_nlp #175

教程

大佬能不能出一个最简化的全量SFT QWEN1.5的代码呀。

yangliuIOC updated 1 month ago
1
lm-sys/FastChat #3112

Need to support Qwen1.5-72B-Chat 32k token?

# HuggingFace https://huggingface.co/Qwen/Qwen1.5-72B-Chat

ggservice007 updated 2 months ago
2
vllm-project/vllm #4677

[Feature]: support lora such as qwen-7b and qwen1.5

### 🚀 The feature, motivation and pitch we had trained a lot of lora with qwen-7b ,if vllm support qwen-7b not only qwen1.5 ,that will be better,thanks ### Alternatives _No response_ ### Additiona…

kynow2 updated 2 months ago
1
ztxz16/fastllm #446

千问qwen1.5-14B-chat解码错误

【现象】 qwen1.5-14B-Chat模型在解码时报UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 1: unexpected end of data。【描述】模型输入是：假设f(x)=x，那么f(x)1到2的积分是多少。模型输出的tokenId包含11995、18137，这两个tokenId会…

yiguanxian updated 2 months ago
2
tencentmusic/supersonic #1401

[Enhancement] 字段转换的时候能否兼容大模型解析s2sql时间格式不满足要求的情况

### Search before asking - [X] I had searched in the [issues](https://github.com/tencentmusic/supersonic/issues?q=is%3Aissue) and found no similar issues. ### Description 大模型输出的sql不稳定，时间条件不满足promp…

wangwz587 updated 5 days ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for qwen1-5

1000+ results
for qwen1-5