Open ColonelPanics opened 7 years ago
You need to build your own copy of the base image (alces gridware docker build --base
) and share it with the cluster (alces gridware docker share base
) in order for the MPI enhancements to be present in images.
We should probably push a new version of the base image to docker.io - not sure of the credentials / process for that though!
We should also consider what the correct solution to the permissions issue is - /opt/gridware
used to have a chgrp -R gridware
run on it, but we no longer do that (as part of adding userspace Gridware).
Pushed rebuild of gridware-base
https://hub.docker.com/r/alces/gridware-base/.
If the ownership change is only needed for the docker exports could it be added as part of feature/configure-docker
?
Presumably we want all cluster users - not just administrators, who traditionally would make up the gridware
group - to be able to launch Gridware MPI jobs? If so then we need another solution as chgrp
is insufficient.
The easy and quick fix is to make the /opt/gridware/docker/exports
directory world-writable.
A more correct approach could be to leave the directory permissions as-is, and have a sudoers.d
entry allowing all users to run sudo docker save -o ${cw_GRIDWARE_root}/docker/exports/*
without password, and modifying the docker_share
command in clusterware-services
accordingly.
On a Flight instance with
feature/configure-docker
enabled, the following is seen (after having created a memtester image):Permissions of the directory:
If these permissions are changed to:
Then the command seems to loop indefinitely waiting for slaves to be ready