Hi, I'd like to know that for example, when enabling model concurrency = 2, does Tritonserver run 2 streams for processing requests? And verses that, if using dynamic batch, is there just 1 stream, and all requests are packed into one batch? And how is model instance related to them?
Hi, I'd like to know that for example, when enabling
model concurrency = 2
, does Tritonserver run 2 streams for processing requests? And verses that, if using dynamic batch, is there just 1 stream, and all requests are packed into one batch? And how is model instance related to them?