ROCm / flash-attention

Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
142 stars 46 forks source link

add rocm benchmark script #11

Closed fsx950223 closed 8 months ago

guangzlu commented 1 year ago

@sabreshao This is the benchmark script asked by frame work team. With this script, we can test performance on rocm reasonably.