laekov / fastmoe

A fast MoE impl for PyTorch
https://fastmoe.ai
Apache License 2.0
1.52k stars 184 forks source link

fix cublas gemm call for bf16 input #171

Closed xptree closed 1 year ago

laekov commented 1 year ago

Oh I should not run bf16 on V100. My fault.