[ ] An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
[ ] My own task or dataset (give details below)
Reproduction
Build Image via Docker
Run triton server successfully.
Send request to triton server as the example.
curl -X POST localhost:9000/v2/models/ensemble/generate -d '{"text_input": "What is machine learning?", "max_tokens": 20, "bad_words": "", "stop_words": ""}'
Expected behavior
Triton server should process the request correctly.
actual behavior
[TensorRT-LLM][ERROR] Encountered an error in forward function: std::bad_cast
[TensorRT-LLM][ERROR] Encountered error for requestId 1804289384: Encountered an error in forward function: std::bad_cast
[TensorRT-LLM][WARNING] Step function failed, continuing.
System Info
Who can help?
@juney-nvidia @kaiyux
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
Triton server should process the request correctly.
actual behavior
additional notes
I find ERROR message while starting the server.