Closed HarikrishnanBalagopal closed 1 month ago
Thanks for making a pull request! 😃 One of the maintainers will review and advise on the next steps.
If race condition is already tested as per this, then it looks good to me.
Abhishek additionally tested this in the image with the accelerate_launch.py script which worked nicely as well
Description of the change
Fixes crash in https://github.com/foundation-model-stack/fms-hf-tuning/issues/359
Related issue number
https://github.com/foundation-model-stack/fms-hf-tuning/issues/359
How to verify the PR
Run a multi GPU training with a non-existent output dir.
Was the PR tested