A new keep_alive parameter has been added to allow for how long the model will stay loaded into memory following the request.
Would it be possible to allow for this configuration to be passed via the extension? More than happy to contribute to this change, if pointed in the right direction.
With the latest release of Ollama v0.1.23: https://github.com/ollama/ollama/releases/tag/v0.1.23
A new
keep_alive
parameter has been added to allow for how long the model will stay loaded into memory following the request.Would it be possible to allow for this configuration to be passed via the extension? More than happy to contribute to this change, if pointed in the right direction.