Optimise symmetrization of muffin-tin functions. The following steps are taken:
Rlm rotation matrices are cached in simulation contexts. They were somehow expensive to generate each time
symmstrizaion is now performed for each atom class independently on an optimal 2D MPI grid. First dimension is used to parallelize atoms of the given symmetry class, second dimension is used to parallelize radial grid points. As such, all MPI rank are involved in symmetrization
minor fix in initializing muffin-tin magnetization - it was not synchronised between all MPI ranks
Optimise symmetrization of muffin-tin functions. The following steps are taken: