The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
The timeout setting of api_server and runner is not working in bentoml.
i'm using bentoml 1.0.20.post11 version
The default configuration is as follows
I modified the timeout of the api_server and the timeout of the runners, but the settings are not applied
When I write service.py as below and enter kentoml serve and test it, it doesn't work normally.
I tried changing the default configuration
I wrote configuration.yaml and tried changing it.
I also checked that it changed properly by printing the config setting.
Describe the bug
The timeout setting of api_server and runner is not working in bentoml. i'm using bentoml 1.0.20.post11 version The default configuration is as follows
I modified the timeout of the api_server and the timeout of the runners, but the settings are not applied When I write service.py as below and enter kentoml serve and test it, it doesn't work normally.
I tried changing the default configuration I wrote configuration.yaml and tried changing it. I also checked that it changed properly by printing the config setting.
but not work, please help me
Please Help Me
To reproduce
No response
Expected behavior
No response
Environment
bentoml: 1.0.20.post11 python: 3.10.12