[Work in Progress] A reference architecture for performing Generative AI Operations (aka GenAIOps) using Kubernetes, with open source tools
For installation, follow the steps provided in the setup doc
GOKU uses a customizable Argo Workflows template to download models from Hugging Face and ingest them into MLFlow.
Distributed RAG Experimentation Framework (DREAM) presents a kubernetes native architecture and sample code to demonstrate how Retrieval Augmented Generation experiments, evaluation and tracking can be conducted in a distributed manner using Ray, LlamaIndex, Ragas, MLFlow and MinIO. Checkout the DREAM README for details
MLBakery is a subproject aimed at creating lightweight base images with AI/ML artefacts, especially small language models and embedding models, for easier portability. Some of the pre-built images are released as packages that can be found in the parent project's ghcr. Ideally, though, you will use the scripts to build your own images :) See the README for more details.
(WIP)
(WIP)
(WIP)
(WIP)