ROCm / flash-attention

Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
142 stars 46 forks source link

Change rounding of bf16 to rtn #78

Closed rocking5566 closed 2 months ago