3dem / relion

Image-processing software for cryo-electron microscopy
https://relion.readthedocs.io/en/latest/
GNU General Public License v2.0
450 stars 201 forks source link

2D Classification crashes after 10 iterations with "master encountered error:" #316

Closed ncoudray closed 6 years ago

ncoudray commented 6 years ago

Hi,

One of our 2D classification crashed after 10 iterartions. I haven't found similar post yet, sorry if I missed it. Here is the run.err: master encountered error: ERROR: Cannot read file /tmp/isomg01/relion_volatile/particles.mrcs It does not exist File: /local/apps/relion/2.1.0/src/image.h line: 157 rm: cannot remove ‘/tmp/4120472.1.gpu1.q/machines’: No such file or directory rm: cannot remove ‘/tmp/4120472.1.gpu1.q/ssh’: No such file or directory

and the run.out: `-catch_rsh /cm/local/apps/sge/var/spool/gpu102/active_jobs/4120472.1/pe_hostfile gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 gpu102 Warning: no access to tty (Bad file descriptor). Thus no job control in this shell. === RELION MPI setup ===

Please let me know if you know the origin of that error,

Thanks, Best, Nicolas

dkimanius commented 6 years ago

Looks like something happened with the scratch-drive file-access, considering the error:

master encountered error: ERROR: Cannot read file /tmp/isomg01/relion_volatile/particles.mrcs...

Make sure that the scratch-drive is properly connected and mounted. You should be able to continue from iteration 10 using the continue-flag.

ncoudray commented 6 years ago

Thanks a lot !