ShannonAI / service-streamer

Boosting your Web Services of Deep Learning Applications.
Apache License 2.0
1.22k stars 187 forks source link

support single gpu multiple models loading #58

Closed Windrill closed 4 years ago

Windrill commented 4 years ago

A tensorflow model reserves as much gpu space as possible when it loads. If multiple models are loaded at the same time they cause resource conflict and no models can start, therefore they must be loaded one by one. "warm_up" option in the Streamer object can force models to load one by one.