NLKNguyen / alpine-mpich

MPI Cluster Automation Solution using Docker, based on Alpine Linux with MPICH (see IEEE paper)
https://github.com/NLKNguyen/alpine-mpich
MIT License
133 stars 56 forks source link

service logs of worker indicate Could not resolve hostname mpi-master or Connection refused #11

Open 13ean opened 6 years ago

13ean commented 6 years ago

I have created service image

when inspect logs of worker, get infomation like this image

simonholgate commented 5 years ago

Out of curiosity, what cloud provider are you using?

I had a real problem connecting to the containers on Google Cloud because the docker encryption on the subnet isn't supported by the NAT. I believe this is the case on AWS too: https://github.com/moby/moby/issues/37115

This isn't an issue on Digital Ocean though.

My solution was to remove the --opt encrypted line from swarm.sh

Another possible issue is that the firewall hasn't got all the necessary ports open.