ahmetoner / whisper-asr-webservice

OpenAI Whisper ASR Webservice API
https://ahmetoner.github.io/whisper-asr-webservice
MIT License
1.99k stars 357 forks source link

Flush VRAM when idle #216

Open ww2283 opened 5 months ago

ww2283 commented 5 months ago

At present, the model remains loaded in the VRAM continuously. It would be beneficial if, after a specified period of idleness, the VRAM could be automatically cleared. Ollama exhibits this functionality, automatically offloading content from the VRAM after five minutes of inactivity.