FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
Trying to run python/examples/simulation/sp_fedavg_mnist_lr_example/torch_fedavg_mnist_lr_one_line_example.py. It loops forever in server_runner.py:bind_account_and_device_id.
Because the request to https://open.fedml.ai/fedmlOpsServer/edges/binding in client_runner.py:bind_account_and_device_id keeps responding a DATA_NO_EXIST_ERROR status_code.
Here is the request json to https://open.fedml.ai/fedmlOpsServer/edges/binding (personal information asterisked).
It seems like from the MLOps id in your example fedml_config.yaml. It's not valid.
So I guess at least it should raise an error otherwise it takes me hrs to locate the issue.
Trying to run
python/examples/simulation/sp_fedavg_mnist_lr_example/torch_fedavg_mnist_lr_one_line_example.py
. It loops forever inserver_runner.py:bind_account_and_device_id
. Because the request tohttps://open.fedml.ai/fedmlOpsServer/edges/binding
inclient_runner.py:bind_account_and_device_id
keeps responding aDATA_NO_EXIST_ERROR
status_code. Here is the request json tohttps://open.fedml.ai/fedmlOpsServer/edges/binding
(personal information asterisked).