In PyTorch v2.1.2, within fmha_api.cpp, the mha_fwd() function redundantly checks cu_seqlens_k

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

https://pytorch.org

Other

82.88k stars 22.34k forks source link

Open 1274085042 opened 6 months ago

1274085042 commented 6 months ago

torch version == v2.1.2

drisspg commented 6 months ago

If you wanted to update this to check that cu_seq_len_q is contiguous that would be sweet and I can review