Closed guillaumekln closed 2 years ago
The softmax op already accumulates values in FP32 so there should be no numerical issues in FP16.
The softmax op already accumulates values in FP32 so there should be no numerical issues in FP16.