Open yuhan1li opened 2 months ago
Hi! I also encontered such issue. You need to install flash-attn (e.g., flash-attn==1.0.2) to let the self_attn equipped with layer .Wqkv()