spcl / open-earth-compiler

development repository for the open earth compiler
https://arxiv.org/abs/2005.13014
Other
72 stars 14 forks source link

LLVM CSE vs (PTX/ROCM) CSE #23

Closed muellch closed 3 years ago

muellch commented 4 years ago

Not performing the common subexpression elimination during the lowering seems to result in very similar performance because the CSE of the PTX compiler compensates.

The open question is which variant produces faster code.