qwen Search Results - Githubissues

1000+ results
for qwen

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

alibaba/higress #1530

ai-proxy 插件后续开发任务

| 任务 | PR | 进度 | | ---| --- | --- | | 从 ai-proxy 插件中抽离 provider 逻辑，以便其他插件进行复用 | | ⏳未开始 | | 支持请求失败时，对本次请求立即进行重试 | | ⏳未…

cr7258 updated 1 day ago
4
PaddlePaddle/PaddleNLP #8663

【LLM】模型参数支持列表

# 模型参数支持专区大家好，PaddleNLP 团队在这里为大家整理了各个模型参数的详细信息，方便大家使用。 ## 模型参数 ### Base Models | Model | 0.5B | 1~2B | 3~4B | 6~8B | 13~14B | 30~32B | 50~60B | 65~72B | 110B | >110B | |:---------:|:--…

DrownFish19 updated 1 week ago
2
pytorch/ao #1080

Int8DynActInt4WeightQATQuantizer doesn't support qwen series

i use `Int8DynActInt4WeightQATQuantizer` to quantize qwen2-1.5B model. But after prepare function, i find that bias is set to False. This is my Code ``` from torchtune.models.qwen2 import qwen2_1_…

elfisworking updated 1 month ago
4
AIDC-AI/Marco-o1 #11

love the model but want more

I really like this model, but do you guys have plans to make another based off of the 14b, 32b Qwen 2.5 models perhaps?

BBC-Esq updated 1 day ago
1
QwenLM/Qwen-Agent #391

利用qwen2.5开源模型，创建Assistant时用files参数加载知识库无法检索内容

用官方qwen-max或者qwen-long不存在这个问题，难道对开源模型有限制？

pro518 updated 1 week ago
1
EvolvingLMMs-Lab/LongVA #30

Question about apply_seq_parallel_monkey_patch("zigzag_ring_…

Hello, thanks for your great work. I have some little questions. When testing a Qwen2 based model, like `llava_qwen` or `lmms-lab/LongVA-7B`, on V-NIAH benchmark, there is a function [apply_seq_…

zhang9302002 updated 6 days ago
1
LiveBench/LiveBench #56

Add Qwen 2.5

https://qwenlm.github.io/blog/qwen2.5/ https://huggingface.co/collections/Qwen/qwen25-66e81a666513e518adb90d9e

carterprince updated 2 months ago
9
mainframecomputer/fullmoon-ios #14

Model request Qwen 2.5 7B 4bit

Highest capability models that can run on latest iPhone would be useful. The best I've found to fit in 8gb RAM is Qwen 2.5 7B 4bit?

mobile-appz updated 1 month ago
2
MeetKai/functionary #271

Functionary on Qwen 2.5

Again, thank you for your work! I think this project does not have the attention it should. This is by far the best OS model that can serve as a general agent in my use cases. Would be curious to …

themrzmaster updated 4 weeks ago
7
axolotl-ai-cloud/axolotl #1966

Flash attention and multipack failing for qwen and mistral

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ###…

tiger241 updated 1 week ago
12

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for qwen

1000+ results
for qwen