3dem / relion

Image-processing software for cryo-electron microscopy
https://relion.readthedocs.io/en/latest/
GNU General Public License v2.0
456 stars 203 forks source link

multi-body refinement FFTW crash #843

Closed Pallesen-IUB closed 2 years ago

Pallesen-IUB commented 2 years ago

This crash is persistent in versions 3.1 and 4.0.

data set: Two optics groups box sizes 400 and 632 pixel sizes 1.054 and 0.666 number of molecular projection images: ~30,000

Job: Multibody refinement with 3 mpi ranks; threads per rank 1 (--j 1) Running on 2 RTX8000 cards on HP Z8 linux tower; Ubuntu

Command: time mpiexec -np 3 which relion_refine_mpi --ref_angpix 1.054 --continue Refine3D/cls6c3mask_it018_optimiser.star --o MultiBody/twobody --solvent_correct_fsc --multibody_masks Bodies.star --oversampling 1 --healpix_order 3 --auto_local_healpix_order 4 --offset_range 3 --offset_step 1.5 --skip_gridding --reconstruct_subtracted_bodies --dont_combine_weights_via_disc --pool 30 --pad 2 --j 1 --gpu "" 1> ./mrefine.out 2> ./mrefine.err

error log (v3.1): No protocol specified

NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes. You may or may not see output from other processes, depending on exactly when Open MPI kills them.

equivalent autorefine job runs without any FFTW-related issues (obviously; I'm continuing this run).

biochem-fan commented 2 years ago

I'm not sure if MultiBody refinement supports optics groups with different box sizes. What happens if you re-extract particles in the second optics group into a 400 px box? It will be 632 * 0.666 / 400 = 1.052 A/px. Since the first optics group is 1.054 A/px, there is little point having the second group at a much smaller pixel size.

Pallesen-IUB commented 2 years ago

I kept them at different box sizes for other reasons. As you picked up easily, they are scaled properly... I thought about this, but read this thread: https://github.com/3dem/relion/issues/543 This user shared a star header revealing box sizes of 384 and 400 and with no apparent issues...

I'll hack, re-extract and re-run...

Pallesen-IUB commented 2 years ago

Runs as expected with same box size across optics groups.

scheres commented 2 years ago

thanks for confirming!