how about using modelmesh to serve thousands of stable diffusion models

Jack47 commented 1 year ago

I want to use modelmesh to serving thousands of stable diffusions models. Any advice would be appreciated~

I'm using triton as serving runtime. Inference time is about 3~10s.
I'm using ensemble in triton to leverage business logics like audit and watermark, maybe they can be standalone service in the future
currently every model have it's own k8s service and ingress rules.

Goals:

ckadner commented 10 months ago

@Jack47 -- were you able to use ModelMesh-Serving for your stable diffusion models? Did you run into any specific issues?

WikiPedia thinks it should look like this :-)

Jack47 commented 9 months ago

currently we don't use modelmesh, thanks for your response.

kserve / modelmesh