issues
search
ROCm
/
flash-attention
Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
142
stars
46
forks
source link
Change rounding of bf16 to rtn
#78
Closed
rocking5566
closed
2 months ago