fix: usage of ReadDataFromJson in array tensors

What does the PR do?

The generate and generate_stream endpoints did not seem to work when directly querying TRTLLM backend with input tokens. This is because the HTTPAPIServer::GenerateRequestClass::ExactMappingInput does not send the correct size of an array input to ReadDataFromJson.

This PR also fixes https://github.com/triton-inference-server/tensorrtllm_backend/issues/369

Checklist

[x] I have read the Contribution guidelines and signed the Contributor License Agreement
[x] PR title reflects the change and is of format <commit_type>: <Title>
[x] Changes are described in the pull request.
[x] Related issues are referenced.
[ ] Populated github labels field
[X] Added test plan and verified test passes.
[ ] Verified that the PR passes existing CI.
[x] I ran pre-commit locally (pre-commit install, pre-commit run --all)
[x] Verified copyright is correct on all changed files.
[ ] Added succinct git squash message before merging ref.
[ ] All template sections are filled out.
[x] Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type box here and add the label to the github PR.

[ ] build
[ ] ci
[ ] docs
[ ] feat
[x] fix
[ ] perf
[ ] refactor
[ ] revert
[ ] style
[ ] test

Related PRs:

Where should the reviewer start?

Test plan:

Added a new test case to L0_http job. Internal CI pipeline id: 18800660

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

triton-inference-server / server

fix: usage of ReadDataFromJson in array tensors #7624