Open zero0kiriyu opened 6 months ago
According to the accelerate usage case, I think it is possible to overload the original loss just before accelerator.backward(loss). We will make a test.
According to the accelerate usage case, I think it is possible to overload the original loss just before accelerator.backward(loss). We will make a test.