ofiwg / libfabric

Open Fabric Interfaces
http://libfabric.org/
Other
527 stars 369 forks source link

prov/efa: Queue txes when handshake is enforced but not made #10115

Closed shijin-aws closed 1 week ago

shijin-aws commented 1 week ago

Currently, when a handshake is required but not made for a request to a given peer, we always return EAGAIN, which caused nasty user experience during the startup stage. This PR contains a series of commits that allows queuing the requests (when it's not for inject), and return 0 in such situation.

shijin-aws commented 1 week ago

Latest push fixed unit test failure and added new unit tests to cover the changes

zachdworkin commented 1 week ago

@shijin-aws PR had the ucx rdm_tagged_peek test failure which was removed from #10124. If you rebase and re-push it will fix it.