RBIR
stands for Rewrite Bigdata in Rust. RBIR aims to create a big data ecosystem using Rust.
This project declares our manifesto and serves as a collection of RBIR projects and posts for anyone interested in joining this journey.
- tikv: Distributed transactional key-value database.
- databend: a rust data warehouse, alternative to Snowflake.
- quickwit: a rust search engine, alternative to Elasticsearch.
- risingwave: a rust streaming processing engine.
- Apache DataFusion: a fast, extensible query engine built in rust.
- influxdb: scalable datastore for metrics, events, and real-time analytics.
- greptimedb: time series database for metrics, logs and events.
- Apache HoraeDB (incubating): a high-performance, distributed, cloud native time-series database.
- paradedb: Postgres for Search and Analytics
- glaredb: An analytics DBMS for distributed data
- fluvio: Lean and mean distributed stream processing system
- lancedb: Developer-friendly, database for multimodal AI
- slatedb: A cloud native embedded storage engine built on object storage
- daft: Distributed DataFrame for Python designed for the cloud, powered by Rust
- arrow-rs: Rust implementation of Apache Arrow.
- iceberg-rust: Rust implementation of Apache Iceberg.
- paimon-rust: Rust implementation of Apache Paimon.
- hudi-rs: Rust implementation of Apache Hudi.
- parquet-rs: Rust implementation of Apache Parquet.
- avro-rust: Rust implementation of Apache Avro.
- orc-rs: Rust implementation of Apache ORC.
- Apache OpenDAL provides python, nodejs, java, go bindings.
- Apache Iceberg is now working on building rust core for pyiceberg.
- Apache Paimon is going to build paimon-py by its rust core.
- Apache DataFusion Comet is a high-performance accelerator for Apache Spark.
- blaze: The Blaze accelerator for Apache Spark leverages native vectorized execution to accelerate query processing.
- Apache Uniffle is working on it's rust shuffle server at incubator-uniffle/rust/experimental/server.