Open szhengac opened 1 year ago
Where are the container filesystems stored? i.e. the value of ENROOT_DATA_PATH
, and how is it mounted?
This error can happen when ENROOT_DATA_PATH
is on a filesystem mounted with noexec
@flx42 I use enroot import
to preprocess the docker image to sqsh format in advance and put it under /run
(tmpfs filesystem), otherwise it will use ~/.cache
and /tmp
and the root disk does not have much available space. ENROOT_DATA_PATH
is empty on my system. By default, what I see is that everything is put under ~/.local
--export="XDG_DATA_HOME=/run/tmp
is going to influence ENROOT_DATA_PATH
and /run
is noexec
on most distributions. I suggest you configure it in /etc/enroot/enroot.conf to some other location.
Also careful with --export
as it will unset all the environment.
I have managed to clean up a disk for this. But thanks for the advice. BTW, I think we can keep the environment by adding ALL
to --export
?
Hi,
I am testing Megatron training on a demo machine with slurm. The machine has very limited disk size. And, I have to change the squashfs filesystem path to
/run
(tmpfs filesystem) by usingsrun --export="XDG_DATA_HOME=/run/tmp"
. I am not sure if this is the right way to change the default path off the home directory, but the error messageNo space left on device
was gone after setting this env var. However, there is another error coming out as shown below. Any advice will be very much appreciated./bin/sh permission should be correct: