FederatedAI / KubeFATE

Manage federated learning workload using cloud native technologies.
Apache License 2.0
420 stars 222 forks source link

在guest方命令行flow mode load 报错 #912

Closed SJTUHGX closed 11 months ago

SJTUHGX commented 11 months ago

[root@70d2404666cd fate]# flow model load -c fateflow/examples/model/publish_load_model.json { "data": { "detail": { "guest": { "9999": { "retcode": 0, "retmsg": "success" } }, "host": { "10000": { "retcode": 100, "retmsg": "<_InactiveRpcError of RPC that terminated with:\n\tstatus = StatusCode.UNAVAILABLE\n\tdetails = \"failed to connect to all addresses\"\n\tdebug_error_string = \"{\"created\":\"@1695389115.712982928\",\"description\":\"Failed to pick subchannel\",\"file\":\"src/core/ext/filters/client_channel/client_channel.cc\",\"file_line\":3217,\"referenced_errors\":[{\"created\":\"@1695389115.712980118\",\"description\":\"failed to connect to all addresses\",\"file\":\"src/core/lib/transport/error_utils.cc\",\"file_line\":165,\"grpc_status\":14}]}\"\n>" } } }, "guest": { "9999": 0 }, "host": { "10000": 100 } }, "jobId": "202309222124555375640", "retcode": 101, "retmsg": "failed" }

SJTUHGX commented 11 months ago

[root@70d2404666cd fate]# flow model load -c fateflow/examples/model/publish_load_model.json { "data": { "detail": { "guest": { "9999": { "retcode": 0, "retmsg": "success" } }, "host": { "10000": { "retcode": 100, "retmsg": "<_InactiveRpcError of RPC that terminated with:\n\tstatus = StatusCode.UNAVAILABLE\n\tdetails = "failed to connect to all addresses"\n\tdebug_error_string = "{"created":"@1695389115.712982928","description":"Failed to pick subchannel","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":3217,"referenced_errors":[{"created":"@1695389115.712980118","description":"failed to connect to all addresses","file":"src/core/lib/transport/error_utils.cc","file_line":165,"grpc_status":14}]}"\n>" } } }, "guest": { "9999": 0 }, "host": { "10000": 100 } }, "jobId": "202309222124555375640", "retcode": 101, "retmsg": "failed" }

问题已解决,需要把host方的ufw防火墙关了。