Will the Qwen-VL support flash attention?

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Other

4.76k stars 359 forks source link

Open jihuishan opened 8 months ago

jihuishan commented 8 months ago

No response

No response

Trying to finetune, yet running slow in training and inferring.

Just like Qwen-7B, could you support flash attention for Qwen-VL?

It perhaps costs some effort to implement.

No response

Liuziyu77 commented 3 months ago

The same Question

cxy1996 commented 1 month ago

the same question