Skip to content
View aikin's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report aikin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data

18 repositories

Apache Iceberg

Java 6,971 2,400 Updated Mar 4, 2025

Energy Efficient Data Platform

Rust 18 1 Updated Dec 2, 2024

Artifacts of the EKGF Data Product Workgroup (DPROD)

HTML 23 8 Updated Feb 7, 2025

The Data Product Descriptor Specification (DPDS) Repository

HTML 77 3 Updated Jan 14, 2025

Open Data Product Specification 3.0

SCSS 9 4 Updated Nov 28, 2024

A curated list of data engineering tools for software developers

7,135 1,283 Updated Feb 17, 2025

An Awesome List of Open-Source Data Engineering Projects

2,361 399 Updated Oct 4, 2024

A curated list of awesome ETL frameworks, libraries, and software.

3,359 347 Updated Jul 23, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,841 1,782 Updated Mar 4, 2025

Open, Multi-modal Catalog for Data & AI

Python 2,698 445 Updated Mar 4, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 12,635 1,604 Updated Mar 4, 2025

Apache Flink

Java 24,581 13,533 Updated Mar 4, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,057 3,640 Updated Mar 4, 2025

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,264 165 Updated Feb 16, 2025

the portable Python dataframe library

Python 5,559 616 Updated Mar 4, 2025

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 64,780 14,607 Updated Mar 4, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,658 28,526 Updated Mar 4, 2025

The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 78,364 7,857 Updated Mar 4, 2025