Open powerpistn opened 1 month ago
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."
You’ll have to rewrite the ensemble to accept these parameters. It’s not worth the effort.
Execute the following three steps to start the model: 1) python3 convert_checkpoint.py, 2) trtllm-build, 3) Use tritonserver to start the model. When requesting the localhost:8000/v2/models/ensemble/generate interface, the input is { "text_input": "Introduce yourself", "max_tokens": 2048 }, I want to use the following form: { "messages":[ {"role":"user", "content":"Introduce yourself" } ], "max_tokens":2048 }, is it available?