qwen Search Results - Githubissues

1000+ results
for qwen

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

QwenLM/Qwen2-VL #520

OSError: Error no file named pytorch_model.bin, model.safete…

How do you load and infer a custom GPTQ quantized Qwen2-VL model (not the default one) using Qwen2VLForConditionalGeneration in **WINDOWS** I used the following code. ``` from transformers impo…

bhavyajoshi-mahindra updated 6 days ago
2
lmstudio-ai/mlx-engine #34

MLX you're using in release LM Studio seems to be outdated a…

See thread here: https://github.com/ml-explore/mlx-examples/issues/776 Large Q8 models like Qwen-VL-72B are uselessly slow unless loaded immediately after a fresh boot, even though there is plenty…

orcinus updated 1 week ago
2
open-compass/VLMEvalKit #571

Qwen-VL-Max-0809 MME中celebrity测出来和榜单结果差距有10分左右

测试代码都是VLMEvalKit，我只改了api为qwen-vl-max-0809，以及测MME的celebrity，prompt这些都没动，计算scores的方法也没动，为什么和榜单上的差异这么大

lihua8848 updated 2 weeks ago
7
unslothai/unsloth #562

Qwen-VL Finetune Support?

Great Work! I'm interested in unsloth and may I use it to finetune MLLM like Qwen-Vl?

KDD2018 updated 5 months ago
2
vllm-project/vllm #3627

[Bug]: System error: Can't get attribute 'TokenizerGroup' on…

### Your current environment ```text cuda 12.1 simple pip install vllm ``` ### 🐛 Describe the bug `python benchmarks/benchmark_throughput.py --backend vllm --input-len 1024 --output-len …

kkwhale7 updated 2 weeks ago
7
microsoft/DeepSpeed #6727

[BUG] any clue for MFU drop?

![Image](https://github.com/user-attachments/assets/459f5917-ac00-449c-8e15-b4bb3d840255) y-axis is MFU and x-axis is training step. I'm testing qwen 72b with huggingface trainer and whenever i trai…

SeunghyunSEO updated 3 days ago
4
vllm-project/vllm #4331

[New Model]: launch error of Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int…

### The model to consider. https://modelscope.cn/models/qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4/ ### The closest model vllm already supports. https://modelscope.cn/models/qwen/Qwen1.5-MoE-A2.7B-Chat/…

eigen2017 updated 3 weeks ago
4
histmeisah/Large-Language-Models-play-StarCraftII #22

The format for the fine-tuning data was not provided.

I'm not exactly clear about what the question-and-answer pairs in the fine-tuning data include. Is it data and formatting similar to the inputs and outputs about LLMs in paper?If you could provide the…

mingtouyizu updated 1 month ago
2
wangzhaode/llm-export #35

convert qwen-7b-chat , failed

When I run: python llm_export.py --type Qwen-7B-Chat --path /mnt/LLM_Data/Qwen-7B-Chat --export_split --export_token --export_mnn --onnx_path /mnt/LLM_Data/Qwen-7B-Chat-onnx --mnn_path /mnt/LLM_Data/…

smartparrot updated 5 months ago
4
TransformerLensOrg/TransformerLens #683

[Bug Report] Qwen model implementation is too inaccurate

The whole Qwen model family seems to be pretty inaccurate. I have not done complete benchmarks to determine where the issue is yet. That still needs to be done to fine the specific area causing the er…

bryce13950 updated 3 months ago
3

上一页 1...20 21 22 23 24 25 26...100 下一页

1000+ results for qwen

1000+ results
for qwen