Open biochem-fan opened 8 years ago
I think this is divided into two problems: the memory aspect and the multiple worker nodes aspect.
The memory aspect of this has now been fixed (I hope, pending @biochem-fan checking this) by adding a LOW_MEMORY_MODE ON (plus OLD_MERGE OFF) functionality. Ambiguity breaker still needs to be made low-memory.
At SACLA, we have only one fat-memory node (1TB RAM) and a time limit of 24 hr for a job. Thus, the memory and time requirement of post-refinement limits the size of the dataset we can process.
I have a suggestion to make post-refinement more scalable on multiple nodes.
This can be ideally implemented using MPI, but more easily by shell scripts using GNU parallel with minimum modifications to cppxfel.