flyteorg / flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
https://flyte.org
Apache License 2.0
5.17k stars 550 forks source link

[Integration] NIM #5478

Open samhita-alla opened 2 weeks ago

samhita-alla commented 2 weeks ago

The NIM plugin should allow serving the model within a task as a sidecar service. This can be especially useful for batch inference, as the model can be spun up once in a local network and accessed directly with no network overhead. The plugin eliminates Docker hassles and the need to manually serve the model.