laekov / fastmoe

A fast MoE impl for PyTorch
https://fastmoe.ai
Apache License 2.0
1.57k stars 189 forks source link

fix cublas gemm call for bf16 input #171

Closed xptree closed 1 year ago

laekov commented 1 year ago

Oh I should not run bf16 on V100. My fault.