pytorch-labs / tritonbench

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
BSD 3-Clause "New" or "Revised" License
20 stars 3 forks source link

Update hstu and fix ragged attn #59

Closed xuzhao9 closed 2 days ago

xuzhao9 commented 2 days ago

Update HSTU ragged attention kernel with code change.

Test plan:

OSS CI

facebook-github-bot commented 2 days ago

@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot commented 2 days ago

@xuzhao9 merged this pull request in pytorch-labs/tritonbench@9f7e9194c71e031b160440d8eb6fe6449c1310c4.