Open vmkhot opened 1 month ago
the tmp directory has to be shared between all MPI compute nodes through some other mechanism (e.g., NFS).
This looks like the other nodes cannot access the tmp_v2
from another node?
The larger issue is that we don't really test our MPI code anymore since we moved away from many low-CPU-core machines to few high-CPU-core machines. So I can't promise that the MPI implementation hasn't bitrotted away.
Expected Behavior
database to database "foldseek search" alignment using foldseek-mpi
Current Behavior
The structure alignment step dies after it sets up the jobs for the structural alignment.
What I ran
Foldseek log
foldseek_issue_log.txt
Context
Your Environment
MMseqs Version: 16dc9150581778c2c65a153ed2e6e418d29fafe3-MPI
Foldseek was self-compiled using the MPI flag
Thoughts
Your help is most appreciated!
Thanks, Varada