KubedAI / spark-rapids-on-kubernetes

Accelerating Data processing workloads on GPUs with Spark-RAPIDS
https://kubedai.github.io/spark-rapids-on-kubernetes/
Apache License 2.0
6 stars 1 forks source link

✨ Spark RAPIDS on Kubernetes πŸš€βš‘

🌟 Overview

Welcome to Spark RAPIDS on Kubernetes, where we accelerate data processing workloads using the power of NVIDIA GPUs and Apache Spark, all running in a Kubernetes environment. πŸš€

By leveraging the RAPIDS cuDF library, this project showcases how you can drastically speed up Spark jobs with GPU accelerationβ€”while saving on costsβ€”all without changing your existing Spark code. This is ideal for scaling AI, machine learning, and data processing workloads.

πŸ“¦ Repo Features

πŸ’Ό Why Spark RAPIDS?

βš™οΈ Spark RAPIDS Features

πŸ“‚ Project Structure

Here’s what you'll find in the project:

.
β”œβ”€β”€ README.md          # Project overview and instructions
β”œβ”€β”€ benchmarks         # Bechmarks scripts
β”œβ”€β”€ docker             # Sample Dockerfiles
β”œβ”€β”€ examples           # Spark RAPIDS job examples
β”œβ”€β”€ infra              # Terraform scripts for cluster setup
β”œβ”€β”€ scripts            # Utility scripts for deployment
β”œβ”€β”€ website            # Docusaurus documentation website
β”œβ”€β”€ ADOPTERS.md        # List of project adopters
β”œβ”€β”€ CONTRIBUTING.md    # Guidelines for contributing to the project
β”œβ”€β”€ LICENSE            # License information

πŸ“š Learn More

For detailed guides, examples, and best practices on using Spark RAPIDS, check out:

To learn more and dive into detailed examples, check out NVIDIA's documentation and explore the spark-rapids-examples.

πŸš€ Getting Started

Ready to dive in? Follow the installation guide to set up your Kubernetes environment with Spark RAPIDS, Karpenter, and the Spark Operator.

Let’s accelerate your data processing workflows and save on costs with GPU-accelerated Spark jobs! πŸ’‘πŸ’»

🀝 Support

This project is free to use, and we'd love to see contributions from the community! If you have any questions, feel free to raise an issue on GitHub or provide feedback.