pmodels / mpich

Official MPICH Repository
http://www.mpich.org
Other
564 stars 279 forks source link

ch4/{ofi,ucx}: Improved error checking for business card exchange #7201

Closed raffenet closed 2 weeks ago

raffenet commented 2 weeks ago

Pull Request Description

Business card exchange in roots only mode involves communication over a special "init" communicator. Take care to check for errors that may arise during the exchange and skip the cleanup of the init comm in that case to avoid masking the source of the error. Fixes pmodels/mpich#7200.

Author Checklist

raffenet commented 2 weeks ago

test:mpich/ch4/most

raffenet commented 2 weeks ago

test:mpich/ch4/most

raffenet commented 2 weeks ago

test:mpich/ch4/most

raffenet commented 2 weeks ago

test:mpich/ch4/most

raffenet commented 2 weeks ago

test:mpich/ch4/most

raffenet commented 2 weeks ago

test:mpich/ch4/most

raffenet commented 2 weeks ago

test:mpich/ch4/most

colleeneb commented 2 weeks ago

This is ok to try testing on Aurora now, correct?

raffenet commented 2 weeks ago

This is ok to try testing on Aurora now, correct?

Yes, this branch is OK to test now.