Closed MasterJH5574 closed 1 month ago
This PR fixes the TIR attention kernels in PagedKVCache tests, which had issues when handling uncommon GQA group size (e.g., 6).
This PR fixes the TIR attention kernels in PagedKVCache tests, which had issues when handling uncommon GQA group size (e.g., 6).