Closed lbin closed 1 year ago
There is no speed benefit using fp16 in losses anyway. There might be numerical problems instead since it's doing sigmoid / cross entropy. So should be good to change regardless of the bug.
There is no speed benefit using fp16 in losses anyway. There might be numerical problems instead since it's doing sigmoid / cross entropy. So should be good to change regardless of the bug.