Closed weiqisun closed 5 months ago
This PR fixes the issue where the MII server hangs after encountering 9999 exceptions. This issue was initially reported in https://github.com/microsoft/DeepSpeed-MII/issues/489.
NVM, this fix doesn't solve the issue since the uid is never returned after the exception in put_request
uid
put_request
This PR fixes the issue where the MII server hangs after encountering 9999 exceptions. This issue was initially reported in https://github.com/microsoft/DeepSpeed-MII/issues/489.