Hi-PACE / hipace

Highly efficient Plasma Accelerator Emulation, quasistatic particle-in-cell code
https://hipace.readthedocs.io
Other
54 stars 15 forks source link

Faster multigrid solve with OMP #1160

Closed AlexanderSinn closed 1 month ago

AlexanderSinn commented 2 months ago

Based on the GPU implementation of gsrb_shared using shared memory, this PR brings the same improvement when using CPUs with OMP by caching phi in a local array.

With 2047^2 cells and 48 OMP threads, this PR gives a 78% speedup of HPMG compared to development.