Open shaunaa126 opened 3 months ago
Right now multi-model sessions are limited to only support full GPU offload. Any plans for supporting CPU offload so that we are able to run multiple models that uses VRAM across CPU and GPU?
Right now multi-model sessions are limited to only support full GPU offload. Any plans for supporting CPU offload so that we are able to run multiple models that uses VRAM across CPU and GPU?