Open elatt opened 4 weeks ago
The GPU predictors would benefit from more workers because the underlying vLLM server can support more concurrent requests but DRUM is blocking this.
This repository is public. Do not put here any private DataRobot or customer's data: code, datasets, model artifacts, .etc.
Summary
Rationale
The GPU predictors would benefit from more workers because the underlying vLLM server can support more concurrent requests but DRUM is blocking this.