os-climate / os_c_data_commons

Repository for Data Commons platform architecture overview, as well as developer and user documentation
Apache License 2.0
20 stars 10 forks source link

Deploy ODH Seldon #95

Open erikerlandson opened 2 years ago

erikerlandson commented 2 years ago

workloads like the corporate data stream ("nlp/pdf") can probably be modified so that they are deployable via Seldon s2i and model management. To explore this it will help to have ODH seldon deployed on the data commons cluster.

cc @MichaelClifford @ChristianMeyndt @HumairAK @redmikhail

erikerlandson commented 2 years ago

basic concept would be the seldon custom model mode: https://github.com/SeldonIO/seldon-core#deploy-your-custom-model-using-language-wrappers

In this scenario something like a pdf object would be pickled and passed over using jsonData format or binData format and the predict function unpacks it and runs the detection model, and returns the resulting output as return json instead of outputting a csv file.

see also this batched-pipeline example workflow: https://medium.com/analytics-vidhya/automated-data-pipeline-using-ceph-notifications-and-kserving-5e1e9b996661