I'm making a GPU version of scuff-em. Thoughts?

HomerReid / scuff-em

A comprehensive and full-featured computational physics suite for boundary-element analysis of electromagnetic scattering, fluctuation-induced phenomena (Casimir forces and radiative heat transfer), nanophotonics, RF device engineering, electrostatics, and more. Includes a core library with C++ and python APIs as well as many command-line applications.

GNU General Public License v2.0

128 stars 51 forks source link

No, I have never tried this, but I'm not aware of a technical reason it shouldn't work. In fact the matrix-assembly step is embarrassingly parallel and should be amenable to GPU speedup. However, the system for caching and re-using the frequency-independent contributions to panel integrals involves global (geometry-wide) hash tables, so for massively parallel execution it might be faster to disable caching and recompute on the fly. Alternatively, if the caching happens in a GPU-local way then there may be no speed penalty.

Feel free to continue the discussion, or close the issue if satisfied.

HomerReid / scuff-em

I'm making a GPU version of scuff-em. Thoughts? #164