Closed krishung5 closed 3 months ago
@rmccorm4 That's great point! I'm not sure about the details so I'll let David or Matt to comment. It's passing the PA pipeline so I hope this doesn't break anything further.
Triton does not guarantee on the order in which outputs will be returned in the response. The order in which outputs appear in the response will depend upon the order in which the backend has added them. The name should be used as the identifier for retrieving the results.
We were seeing L0_long_running_stress test failing due to an output doesn't match to the expected output.
It seems like it's because the order of
infer_data_.outputs_
andinfer_data_.expected_outputs_
might be different, so we need to make sure that we are comparing the correct validation output based on the same output name. Shout out to @rmccorm4 who quickly came up with the solution! This PR is based on https://github.com/triton-inference-server/client/pull/685