I am sending 100 data samples in a single request (1 request with data containing 100 examples)
When I set adaptive batching to these values:
max_batch_time = 0.25
max_batch_size = 8
then request gets processed in 3.5 seconds, but if do not set these two parameters (i.e. disable adaptive batching), then the same request takes 8 seconds.
This looks strange, as in my opinion adaptive batching should not have impact when making a single request.
Hi!
I am sending 100 data samples in a single request (1 request with
data
containing 100 examples)When I set adaptive batching to these values:
then request gets processed in 3.5 seconds, but if do not set these two parameters (i.e. disable adaptive batching), then the same request takes 8 seconds. This looks strange, as in my opinion adaptive batching should not have impact when making a single request.