opendatahub-io / caikit-tgis-serving

Apache License 2.0
17 stars 39 forks source link

Caikit-TGIS-Serving

Caikit-TGIS-Serving is a stack that allows data scientists to perform Large Language Model (LLM) inference.

The Caikit-TGIS-Serving stack consists of these components:

Architecture of the stack

KServe+Knative+Istio+Caikit_TGIS Diagram

Installation

The procedures for installing and deploying the Caikit-TGIS-Serving stack have been tested with Red Hat OpenShift Data Science self-managed on Red Hat OpenShift Service for AWS (ROSA) and OpenShift Dedicated clusters. They have not been tested with the OpenShift Data Science managed cloud service.

Prerequisites

Procedures

As of Red Hat OpenShift Data Science version 2.5.0, you can follow the official docs here for up-to-date installation instructions.

For RHODS<2.5.0 and ODH, there are two ways to install the KServe/Caikit/TGIS stack:

Demos

After you install the KServe/Caikit/TGIS stack, you can try these demos: