The demo code provided by read.md runs incorrectly

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Apache License 2.0

2.08k stars 116 forks source link

The demo code provided by read.md runs incorrectly #239

Open WilliamRocketRen opened 2 hours ago

WilliamRocketRen commented 2 hours ago

When I use the demo provided by read.md to run to output_ids = model.generate(**inputs, max_new_tokens=128), an error RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! (when checking argument for argument mask in method wrapper_CUDA__maskedscatter) appears. My devlepment environment is as follows: cuda:11.8 pytorch:2.1.0&2.4.1 transformers:4.45.0.dev0 accelerate:0.34.2 hardware environment: H800 80G

WilliamRocketRen commented 2 hours ago

CUDA_AVIBLE_DEVICES=0,1,2,3,5,6,7