issues
search
awslabs
/
palace
3D finite element solver for computational electromagnetics
https://awslabs.github.io/palace/dev
Apache License 2.0
237
stars
49
forks
source link
Collection of minor performance fixes during profiling and GPU testing
#181
Closed
sebastiangrimberg
closed
7 months ago
sebastiangrimberg
commented
8 months ago
Only parallelize libCEED across OpenMP threads when CPU backends are used
Add some missing
AddMult
/
AddMultTranspose
overrides and avoid calling them when they don't exist to avoid temporary vectors
Avoid constructing discrete gradient matrix on coarse mesh unless necessary for coarse solve (AMS)
AddMult
/AddMultTranspose
overrides and avoid calling them when they don't exist to avoid temporary vectors