Open eero-t opened 3 days ago
Same error also with the Git HEAD version of everything.
TEI's limitation requires inputs to be less than 512 tokens. This issue occurs when the length of retrieved documents exceeds this limit. To address this, we can implement a workaround in the retrieval microservice to ensure that the length of retrieved documents is limited to under 512 tokens.
TEI's limitation requires inputs to be less than 512 tokens. This issue occurs when the length of retrieved documents exceeds this limit.
Yes, that was clear from the error message.
To address this, we can implement a workaround in the retrieval microservice to ensure that the length of retrieved documents is limited to under 512 tokens.
Thanks, I think that's acceptable, assuming it's really "... limited to the configured max number of tokens".
PS. Seeing "Internal Server Error" for trivial input variations like this, indicates that testing for potential errors is not yet at the level where I would expect it to be in my own projects. :-)
Built & ran v0.6 of Xeon ChatQnA, following these instructions: https://github.com/opea-project/GenAIExamples/blob/main/ChatQnA/kubernetes/manifests/README.md
After running the verification query, changed one letter from the query message (2023 -> 2022):
$ curl http://${chatqna_svc_ip}:8888/v1/chatqna -H "Content-Type: application/json" -d '{"messages": "What is the revenue of Nike in 2022?"}'
And got:
Internal Server Error
ChatQnA service log shows:
Reranking service log:
tei-reranking:
2024-06-25T18:14:59.914279Z ERROR rerank:predict{inputs=("What is the revenue of Nike in 2022?", ... }: text_embeddings_core::infer: core/src/infer.rs:364: Input validation error:
inputsmust have less than 512 tokens. Given: 545