qwen Search Results - Githubissues

1000+ results
for qwen

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TideDra/VL-RLHF #10

微调qwen爆内存

您好，使用原始代码在2张A100 80G上面微调qwen，显存占用两张卡上都只有919M，但是在数据加载过程中？内存占用一直在增加，直到180多G后内存爆了，程序终止。请问这个问题怎么解？训练log： ![image](https://github.com/TideDra/VL-RLHF/assets/36758049/09277b55-ea0a-4cfd-875b-792f457441a2…

delian11 updated 4 months ago
3
mlc-ai/mlc-llm #2670

[Model Request] please support Qwen-VL model

## ⚙️ Request New Models - Link to an existing implementation (e.g. Hugging Face/Github): - Is this model architecture supported by MLC-LLM? (the list of [supported models](https://llm.mlc.ai/do…

junwenZhang updated 2 months ago
2
modelscope/data-juicer #457

How to use ‘hf_model’

### Before Asking 在提问之前 - [X] I have read the [README](https://github.com/alibaba/data-juicer/blob/main/README.md) carefully. 我已经仔细阅读了 [README](https://github.com/alibaba/data-juicer/blob/main/README…

abchbx updated 2 weeks ago
3
tenstorrent/tt-buda-demos #125

Support running Qwen1.5-0.5B on Wormhole(n150/n300)

we have wanted to run qwen1.5 on our existing wormhole cards for some days. Happy to see Qwen1.5 appeared in the supported model list of the pybuda-0.19.3 just released，but it still only supports run…

liji0276 updated 1 month ago
1
camel-ai/camel #1172

[Feature Request] GraphRAG with FinDKG

### Required prerequisites - [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…

Wendong-Fan updated 9 hours ago
2
X-PLUG/MobileAgent #65

请教大佬，PC-Agent中gpt-4o进行对话的部分，能否换成本地部署的Qwen-VL-Chat？

![image](https://github.com/user-attachments/assets/5aaa3655-2e50-4649-b12d-6b323ff02444) 图片中标注的那部分能够换成千问

shenyugub updated 1 week ago
7
QwenLM/Qwen2-VL #529

通义千问2-VL-2B-Instruct-GPTQ-Int4不支持多轮图片识别

通义千问2-VL-2B-Instruct-GPTQ-Int4不支持多轮图片识别错误提示： { "object": "error", "message": "At most 1 image(s) may be provided in one request.", "type": "BadRequestError", "param": null, …

HaoWang81 updated 6 days ago
1
UbiquitousLearning/mllm #126

How did you obtain the two model files, qwen-1.5-1.8b-chat-i…

How did you obtain the two model files, qwen-1.5-1.8b-chat-int8.mllm and qwen-1.5-1.8b-chat-q4k.mllm?

yhwang-hub updated 2 months ago
3
OFA-Sys/InsTag #10

Details about Qwen-1_8B Instagger.

I see there is a Qwen-1_8B version of Instagger on ModelScope. Could you please share the prompt you used for finetuning this model so that we can obtain better results when using the tagger.

litsh updated 3 months ago
4
1694439208/GOT-OCR-Inference #10

转换后的gguf模型, 无法处理qwen里面的如 <|im_start|><|im_end|> <img> 等特殊标记,

转换后的gguf模型, 无法处理qwen里面的如等特殊标记, 分词的时候会出现把分割为 "

joeqi0370 updated 3 weeks ago
11

上一页 1...16 17 18 19 20 21 22...100 下一页

1000+ results for qwen

1000+ results
for qwen