Closed rodarima closed 4 years ago
It turns out the computation time is smaller as the number of processes increases. The creation of the plan dominates the solver time with more than 2 processes.
Processes Stats
1 Solver comp/total: 9.393120e-04/1.369303e-03 = 6.859782e-01
2 Solver comp/total: 1.384648e-02/3.762607e-01 = 3.680022e-02
4 Solver comp/total: 2.705376e-02/2.438905e+00 = 1.109258e-02
Creating the plan only at iteration -1 solves the problem, see 56c5f4228d85928ecd75c8c28f62af183f1a176e
It seems the communications in the FFT may cause this issue. Can we investigate the create plan overhead compared to the FFT computation?