FedML-AI / FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
https://TensorOpera.ai
Apache License 2.0
4.2k stars 787 forks source link

An error occurred when running this code: args=fedml.init() #1193

Open FryLcm opened 1 year ago

FryLcm commented 1 year ago

23a19e5bb2fa300f65bd9ad4283a529

FryLcm commented 1 year ago

please how can i solve it

rekkles2 commented 1 year ago

I encountered the exact same problem #1521

fedml-dimitris commented 1 year ago

Hello,

the recommended way to start your client or server scripts is by calling the respective bash script. Please have a look at this README.md.

The error you are seeing above is because you call your Python script without passing the configuration file, and therefore, during initialization, the _global_training_type and _global_comm_backend variables are not set. See also here and here how these two variables are set through the config file.

Hope this helps to solve the problem.