RBIR
stands for Rewrite Bigdata in Rust. RBIR aims to create a big data ecosystem using Rust.
This project declares our manifesto and serves as a collection of RBIR projects and posts for anyone interested in joining this journey.
A high-performance accelerator for Apache Spark, built on top of the powerful Apache DataFusion query engine.
A high-performance, distributed, cloud native time-series database.
A distributed stream processing engine written in Rust, designed to efficiently perform stateful computations on streams of data.
The Blaze accelerator for Apache Spark leverages native vectorized execution to accelerate query processing.
A distributed query engine for large-scale data processing in Python and is implemented in Rust.
An open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
An analytics DBMS for distributed data.
An open-source, cloud-native, unified time series database for metrics, logs and events with SQL/PromQL supported.
An open-source database for vector-search built with persistent storage, which greatly simplifies retrieval, filtering and management of embeddings.
An Elasticsearch alternative built on Postgres.
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo
A Postgres-compatible SQL database engineered to provide the simplest and most cost-efficient approach for processing, analyzing, and managing real-time event streaming data
A cloud native embedded storage engine built on object storage.
Distributed transactional key-value database, originally created to complement TiDB
The leading open source time series database for metrics, events, and real-time analytics.
Native Rust implementation of Apache Arrow
Rust implementation of Apache Avro
An extensible query engine written in Rust that uses Apache Arrow as its in-memory format.
Rust implementation of Apache Hudi
Rust implementation of Apache Iceberg
A unified data access layer, empowering users to seamlessly and efficiently retrieve data from diverse storage services.
Rust implementation of Apache ORC
Rust implementation of Apache Paimon
Rust implementation of Apache Parquet