Data
The Data Product Descriptor Specification (DPDS) Repository
Open Data Product Specification 3.0
A curated list of data engineering tools for software developers
An Awesome List of Open-Source Data Engineering Projects
A curated list of awesome ETL frameworks, libraries, and software.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Open, Multi-modal Catalog for Data & AI
An orchestration platform for the development, production, and observation of data assets.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
Apache Superset is a Data Visualization and Data Exploration Platform
Apache Spark - A unified analytics engine for large-scale data processing
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.