Open pedramhaqiqi opened 2 months ago
Removing the reductions, to be done serially, given that we are not in a parallel region: 3.0x
We should specify the shared/private property of the used variables instead of letting OMP decide the default. Can you try adding that?
By putting each loop of the iterations in a simple
pragma omp for
with no shared parallel region and performing reductions on diff and page_rank_total achieves a 2.5x speed up from baseline.Precompute section (Gains about 50 iterations on avg)
Main page rank computation
Setting new page rank
Reduction for diff
Reduction for page rank