Originally posted by varunsh-xilinx September 13, 2022
If there are two workers with different batch sizes, can the server dynamically send requests to the right one based on some criteria.
Can a single worker accept a range of batch sizes?
The batcher's timeout is statically configured currently. If the worker is busy for example, the batcher may as well hold on to the batch and try to put more together to improve throughput.
Originally posted by varunsh-xilinx September 13, 2022