Open slizzered opened 9 years ago
I started working on this. branch: https://github.com/slizzered/haseongpu/tree/issue51-openMP-hostcode
Not sure how necessary this actually is... might introduce code complexity without benefits. See also the commit message: ( https://github.com/slizzered/haseongpu/commit/91625acadf2c623667acb5b404a6138d12a7ff65 )
To gain maximal performance we need to reduce the runtime of our sequential code base. We have two possibilities to gain this reduction:
So, I think some investigation makes sense. But, you are right, it looks a bit weird if computation unrelated code is parallized with OpenMP :hibiscus:
Well, it looks not too weird to me. The problem is rather, that it does not bring any speedup, since the loops are pretty small/fast.
The MATLAB functions seem to be one of the more important problems (really slow...)
There might be some loops that can pose a bottleneck. They might be rather easily parallelized with OpenMP (cmake-file will need to be tweaked)