I added a custom wrapper for fetch to control the headersTimeout value. The default value is 5 minutes, and slower instances of ollama can take longer than that, so the fetch fails. The following environment variable allows you to up it (the default is still 5 min).
I added a custom wrapper for fetch to control the
headersTimeout
value. The default value is 5 minutes, and slower instances of ollama can take longer than that, so the fetch fails. The following environment variable allows you to up it (the default is still 5 min).There's more info on this in issue 628. I built this locally and tested it against my ollama instance. With the default timeout:
Hard stop at 5 min. The inference job fails.
Here is the same job after upping the fetch timeout.
One other small tweak: The docker-compose.dev.yml file needed the env file declared under the worker service.
Thanks for making this useful app.