Open valassi opened 2 weeks ago
- there is a high non-ME component (here stil called 'fortran overhead', these are olf timers)
specifically, fortran and cpp have 668s, cuda has 826
- there is a high outside-madevent ('python/bash'? time spent deleting the applications??) component
specifically, fortran has 1945-1910 i.e 35s, cuda has 969-853 i.e. 116s
I have stripped off the python/bash component to #1000 (for cuda but not only!). Instead here I keep only the non-ME madevent component (in cuda).
Yesterday I ran some very first tests of cuda DY+3j with (OLD) timers in PR #948.
The cuda profiles are clearly weird
This is for 500 events