microhh / rte-rrtmgp-cpp

C++ / CUDA implementation of RTE+RRTMGP radiative transfer solver
BSD 3-Clause "New" or "Revised" License
3 stars 19 forks source link

Tuning `combine_and_reorder_2str` #17

Open bartvstratum opened 3 years ago

bartvstratum commented 3 years ago

I'm starting with this kernel, it is currently the most expensive and not yet tuned kernel.