qwen Search Results - Githubissues

1000+ results
for qwen

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #10357

[Bug]: Qwen2 VL takes only 18Gb when run by using hugggingfa…

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### Model Input Dumps Thanks for the great work. I use…

Samjith888 updated 2 days ago
10
fruitbars/simple-one-api #29

aliyun支持的第三方模型无法使用

比如 baichuan-7b-v1 目前是限时免费的 { "models": [ "qwen-long", "qwen-turbo", "qwen-plus", "qwen-max", …

botoai updated 2 months ago
2
eric-mitchell/direct-preference-optimization #52

Qwen model issues & embedding and loss has nan

after a loss backward and optimizer step, then forward the embedding layer output hidden states become inf and loss is nan.

lylcst updated 1 month ago
5
QwenLM/Qwen2-Audio #50

对llm backbone的困惑

我详细的阅读了qwen audio 2的源代码，并对模型的架构进行了进一步的探索。作者之前声明qwen aduio 2使用的是qwen-1作为llm，但是却在config中出现了qwen2作为text_config，这是令人困惑的。 llm的layer num是32，这与qwen-7b保持一致，但是attention却使用qwen2的attention，让我产生了很大的困惑？

heiyonghua updated 2 months ago
1
modelscope/ms-swift #2440

When deploying Qwen-2VL using Swift, the generated results e…

I deployed Qwen-2VL-72B using Swift, but during multi-image content inference, the generated results consistently terminate early. Could you advise on how to resolve this? The startup script is as …

gt-liuzijie01 updated 3 days ago
3
dashscope/dash-cookbook #14

Is it also possible to use langchain for international user.

I have try the following from langchain document ``` import { ChatAlibabaTongyi } from "@langchain/community/chat_models/alibaba_tongyi"; import { HumanMessage } from "@langchain/core/messages";…

ricoyudog updated 1 month ago
1
vllm-project/llm-compressor #890

Only one GPU shows usage while quantizing

Hi, while quantizing large models (qwen 72b) on 5x A40 GPUs, I noticed that only the first GPU seems to show high (80-90%) utilisation, while the rest sit at 0%. Is this something normal, or am I miss…

gmonair updated 1 week ago
4
EvolvingLMMs-Lab/lmms-eval #98

qwenvl-7b evaluate refcoco|+|g cider and IOU are all None,

accelerate launch --main_process_port=29501 --num_processes=8 -m lmms_eval --model qwen_vl --model_args pretrained=/Qwen-VL/ --tasks refcoco,refcoco+,refcocog,refcoco_bbox_rec,refcoco+_bbox_rec,refcoc…

AderonHuang updated 1 day ago
21
BerriAI/litellm #2890

[Feature]: Hope to add Qwen, Zhipu, Qianfan models

### The Feature langchain has good examples and hopes to be added to litellm, here is the link: [https://python.langchain.com/docs/integrations/chat/tongyi/](https://github.com/BerriAI/litellm/issue…

zryf2000 updated 1 month ago
1
QwenLM/Qwen-VL #477

Extracting Unimodal Features

Hello! I am trying to use Qwen-VL to extract unimodal features for a given input image and accompanying text query. How can that be achieved? I am aware that models like BLIP-2 have a direct API (extr…

sreebhattacharyya updated 1 month ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for qwen

1000+ results
for qwen