Closed sudomakeinstall closed 6 years ago
cpd doesn't have any OpenMP support itself, all OpenMP stuff is in fgt. Are you sure your test program is using GaussTransformFgt
and not GaussTransform
?
If you confirm that you are using GaussTransformFgt
and are still seeing incorrect behavior, the problem is most likely in the fgt library and not cpd. Let me know what you find.
(and sorry about the delay, thanks for your patience!)
@gadomski Thanks very much for the response--it looks like I must have missed one of my own steps, because after I deleted, rebuilt, and reinstalled all the packages (FGT, CPD, and my own) it parallelizes as expected. Sorry for the trouble, and thanks again for your work putting this together!
No worries, thank you!
All--
Thanks so much to everyone who has worked on this library! I'm having some difficulty parallelizing the code, however, and was hoping someone might be able to point me in the right direction. Here's what I've done so far:
I've compiled
fgt
with the following options:I've compiled
cpd
with the following options:I've compiled my own project with:
I've set
OMP_NUM_THREADS=$(nproc)
in my~/.bashrc
.I've checked (at the top of my test program) that
8
threads are visible toOpenMP
:Despite these steps, the program still only appears to be using one core at a time when I run a test program:
Any ideas what I might be doing wrong?
Best, and thanks,
--Davis
P.S. I'm using
g++
version 7.2.0 on Ubuntu 16.04.