Starred repositories
Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
An extremely fast Python package and project manager, written in Rust.
🦀 Small exercises to get you used to reading and writing Rust code!
A curated list of Rust code and resources.
A native Rust library for Delta Lake, with bindings into Python
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
An orchestration platform for the development, production, and observation of data assets.
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
DuckDB is an analytical in-process SQL database management system
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data