OrderN / CONQUEST-release

Full public release of large scale and linear scaling DFT code CONQUEST
http://www.order-n.org/
MIT License
96 stars 25 forks source link

Overlap communication with computation in multiply_module #290

Open ilectra opened 5 months ago

ilectra commented 5 months ago

Fixes #265

tkoskela commented 5 months ago

From @davidbowler: compute is only being called on kpart [2 : end]. To fix, call compute kernels on kpart -1 then call once after the loop on kpart.

ilectra commented 5 months ago

I think this can be reviewed now. I'll produce some profiles, to see if we gained anything, when I'm back from the holidays, I don't think I'll have time tomorrow.