Open ganeshkrishnan1 opened 6 months ago
bfloat16 is not supported on ampere devices so if flash attention 2 is not supported its an ampere device and dtype has to be float16
bfloat16 is not supported on ampere devices so if flash attention 2 is not supported its an ampere device and dtype has to be float16