bug: Incorrect return type for Dolly-v2 model

bentoml / OpenLLM

Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.

Apache License 2.0

10.16k stars 642 forks source link

Describe the bug

In dolly_v2 configuration the return statement seems to be looking for the key "generated_key" in the first element of the result. However no such key exists since the returned results is a string.

What seems to work is changing the return statement in configuration_dolly_v2.py from -> return generation_result[0]['generated_text'] to -> return generation_result[0]

To reproduce

No response

Logs

No response

Environment

bentoml openllm requests python=3.10

System information (Optional)

No response

bentoml / OpenLLM