epam / ai-stacks

Home for Kubeflow Infrastructure as a Code templates
Apache License 2.0
2 stars 2 forks source link

Azure ML Serving Stack: T5 Model Deployment #13

Closed oginskis closed 1 year ago

oginskis commented 1 year ago

Create an inference endpoint that serves T5 Model, this could be a single component or multiple components. The following capabilities should be provided:

oginskis commented 1 year ago

Decided to use KServe example mnist model: https://github.com/epam/hub-kubeflow-stacks/commit/3337e9621354fc52f8d98db4c17c31a153738e26

oginskis commented 1 year ago

Will try to serve T5 when less busy with client projects

oginskis commented 1 year ago

Outdated