Closed dkp closed 1 month ago
Hi @dkp!
I ran into this exact same issue with qsiprep awhile back. It's the ABI tags in the libQt5 library. It produces a very tricky error message that says the library isn't there when it is - the host system just can't load it because of those tags.
@tsalo here is where the tags get stripped out. Does this happen in the xcpd build?
It does not, but I can add it. Thanks!
I just merged #1293, which should hopefully fix the problem. @dkp once pennlinc/xcp_d:unstable
updates on DockerHub (should happen in ~2 hours), would you be willing to try it out on your HPC?
Thank you, I will try ASAP (hopefully today, and will let you know as soon as I have results)
I ran it a couple of ways. But, the most recent was a clean run with no work directory and no previous derivatives.
It worked! The output looks appropriate and complete (from skimming it) and slurm reports success. Yay!! Thank you.
Summary
I am trying XCP-D v0.10.0rc1 and running into crashes on the HPC but not my local linux box. Version xcp-d_v0.8.3.sif runs without issue in both environments on this same fmriprep dataset (though 0.83 requires
--file-format cifti --warp-surfaces-native2std
)See attached slurm log and crash report
Additional details
OS's compared:
Local linux Box: Ubuntu 24.04.1 LTS HPC: CentOS Linux 7 (Core)
Input data:
fmriprep 24.1.1 (run as follows):
What were you trying to do?
XCP-D command (same on both systems):
What did you expect to happen?
I expected the 0.10.0 pipeline to run in both environments just like the 0.83 pipeline before it
What actually happened?
The 10.0 pipeline ran correctly on the local linux box, but failed, with the same call and same data on the HPC.
Reproducing the bug
This seems to be specific to some interaction with the HPC that has changed between XCP-D version 0.83 and version 0.10.0. I have not tested intermediate versions.
crash-20241013-113252-dkp-surface_sphere_project_unproject-636e1e22-8be1-435f-b46f-0e610f0d122a.txt
slurm-xcpdfail.txt