gmarkall / manycore_form_compiler

MCFC is deprecated. See https://code.launchpad.net/~grm08/ffc/pyop2
https://code.launchpad.net/~grm08/ffc/pyop2
GNU General Public License v3.0
3 stars 1 forks source link

Fuse loop nests for coefficient evaluation at quadrature points #55

Open kynan opened 12 years ago

kynan commented 12 years ago

At present, we generate a separate loop nest for each coefficient we need to evaluate at the quadrature points (a loop over dimensions per rank and a loop over basis functions). These loops could be fused as far as possible. Coefficients of equal rank could be trivially evaluated in the same loop nest.

We need to keep in mind though that this can have performance implications (e.g. cache performance) depending on the layout of the coefficients in memory. Loop fusion may hence even have a negative impact on performance.