x10-lang / x10

Core implementation of X10 programming language including compiler, runtime, class libraries, sample programs and test suite
Eclipse Public License 1.0
71 stars 15 forks source link

Agree on the minimum number of steps before computing steps' statistics #18

Closed shamouda closed 8 years ago

shamouda commented 8 years ago

A failure may be noticed in some places few steps after other places, resulting in disagreement between places on the number of executed steps, and consequently a failure in allreduce calls that compute the min/max/sum of the stepTimes list. To fix this bug, places first agree on the minimum number of steps, then use it to size the allreduce buffers.