terribly slow: format="json", stream=False, please advise

ollama / ollama-python

Ollama Python library

https://ollama.com

MIT License

2.71k stars 223 forks source link

terribly slow: format="json", stream=False, please advise #79

Closed eliranwong closed 4 months ago

eliranwong commented 4 months ago

I appreciate great work achieved with ollama. I see a lot of potential of use.

However, when I tested. I found that the generate and chat function takes way too long time to response, when I set format="json" and stream=False. Speed normal with non-json mode and stream. Please advise. many thanks.