datarobot / datarobot-user-models

User's custom models boilerplate
Other
96 stars 85 forks source link

Test vLLM env in multi-process mode #1140

Open elatt opened 4 weeks ago

elatt commented 4 weeks ago

This repository is public. Do not put here any private DataRobot or customer's data: code, datasets, model artifacts, .etc.

Summary

Rationale

The GPU predictors would benefit from more workers because the underlying vLLM server can support more concurrent requests but DRUM is blocking this.