Open dazzng opened 2 months ago
Flash attention requires CUDA or ROCm. There's a recent metal port to make it work on M series Macs, but no way to install as of now.
Flash attention requires CUDA or ROCm. There's a recent metal port to make it work on M series Macs, but no way to install as of now.