Open pc363449207 opened 1 year ago
I don't know. The
/public/home/pancheng/cactus/cactus_env/bin/python3: bad interpreter: No such file or directory
message makes it really look like the worker job cannot find /public/home/pancheng/cactus/cactus_env/bin/python3
for some reason.
Hello,
I am encountering an issue when running Cactus on a SLURM cluster. After executing the command:
cactus ./js ./examples/evolverMammals.txt evolverMammals.hal --batchSystem slurm --batchLogsDir batch-logs --consCores 80
I receive the following error message:
[2023-11-15T15:59:37+0800] [MainThread] [W] [toil.leader] No log file is present, despite job failing: 'progressive_workflow' kind-progressive_workflow/instance-hlg2sd3l v1) [00:32<00:00, 0.75 jobs/s] [2023-11-15T15:59:37+0800] [MainThread] [W] [toil.leader] The batch system left an empty file batch-logs/toil_1d743db1-387f-4c1f-be9a-55f8d653b0dc.24.83368.out.log [2023-11-15T15:59:37+0800] [MainThread] [W] [toil.leader] The batch system left a non-empty file batch-logs/toil_1d743db1-387f-4c1f-be9a-55f8d653b0dc.24.83368.err.log: [2023-11-15T15:59:37+0800] [MainThread] [W] [toil.leader] Log from job "kind-progressive_workflow/instance-hlg2sd3l" follows: =========> /opt/gridview/slurm/spool/slurmd/job83368/slurm_script: /public/home/pancheng/cactus/cactus_env/bin/_toil_worker: /public/home/pancheng/cactus/cactus_env/bin/python3: bad interpreter: No such file or directory <=========
However, I have verified that the path /public/home/pancheng/cactus/cactus_env/bin/python3 does indeed exist and is accessible.
Here are some key details regarding my setup: I am using a shared file system in the SLURM cluster. The virtual environment is correctly set up at the mentioned path. Python 3.8 is installed, and the virtual environment was created and activated without any issues. The issue persists despite the environment being correctly activated in the SLURM job script. I am not sure why this error is occurring since the specified interpreter path is correct and works outside of the SLURM environment. Any insights or suggestions on how to resolve this issue would be greatly appreciated.
Thank you!