jlewi / flaap

Federated Learning and Analytics Protocols
Apache License 2.0
0 stars 0 forks source link

Detect stalled jobs due to insufficient workers #26

Open jlewi opened 2 years ago

jlewi commented 2 years ago

Right now if tasks are created but there are insufficient workers the coordinator will just time out waiting for the tasks to complete.

How can we detect and surface this error better.