issues
search
MayDomine
/
Burst-Attention
Distributed IO-aware Attention algorithm
Apache License 2.0
17
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Differences between Triton and Cuda implementations
#4
gabeweisz
opened
2 months ago
3
Can burst-attention be used in Model Inference?
#3
gitcloneman
opened
5 months ago
2
Benchmark issue
#2
Iron-Bound
closed
6 months ago
1
example modify
#1
MayDomine
closed
1 year ago
1