I'm running demo/question-answering/triton_client.py from the examples directory. The script returns expected result with batch_size=1. However, if you make the batch_size > 1 in this line, it outputs only the result of the first element in the batch and other elements are ignored.
I saw #84 and #106 about the question-answering example and batch_size but I don't think they are related to this. The triton server does not yield in any errors.
I'm running
demo/question-answering/triton_client.py
from the examples directory. The script returns expected result withbatch_size=1
. However, if you make thebatch_size
> 1 in this line, it outputs only the result of the first element in the batch and other elements are ignored.I saw #84 and #106 about the question-answering example and batch_size but I don't think they are related to this. The triton server does not yield in any errors.
Am I missing something here?