Closed ponweist closed 10 years ago
Parallelization now done in this commit.
Output files (*.dat) are identical compared to c9be08b2c09a868a9c4da6662cb6c62a7778b87f.
Scaling is good (as expected); runtime is down from 123s to 17s for the 16sm case running at 16 processes.
Trace files before: ... and after:
The following loop (kpath.F90) needs to be parallelized: