FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
Resolved conflicts if user code had bootstrap.sh file name by modifying fedml generated bootstrap file name from bootstrap.sh to fedml_bootstrap_generated.sh
Updated file clean up logic, making it bit more modular
Fixes:
bootstrap.sh
file name by modifying fedml generated bootstrap file name frombootstrap.sh
tofedml_bootstrap_generated.sh