Open RomanKoshkin opened 6 months ago
We haven't tested on V100s so I'm not sure. I thought it worked but haven't checked.
I actually tried it just now. The issue is that V100 has poor bfloat16 support. If you just change all the bfloat16 instances to float16 it should work.
Everything works on my A6000s and A100s, but not on the older V100 (says compute capability is low). Are there plans to add support for the legacy devices? Thanks!