laofan13

Follow

🎯

Focusing

laofan laofan13

🎯

Focusing

Follow

18 followers · 115 following

shanghai
10:16 (UTC -12:00)

Stars

bigData

BigData System

19 repositories

apache / zookeeper

Apache ZooKeeper

Java 12,317 7,256 Updated Dec 21, 2024

oxnr / awesome-bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,341 2,561 Updated May 7, 2024

apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,528 2,439 Updated Dec 25, 2024

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,705 1,735 Updated Dec 21, 2024

apache / iceberg

Apache Iceberg

Java 6,696 2,293 Updated Dec 27, 2024

apache / hive

Apache Hive

Java 5,597 4,698 Updated Dec 23, 2024

leesf / hudi-resources

汇总Apache Hudi相关资料

541 158 Updated Dec 22, 2024

apache / kafka

Mirror of Apache Kafka

Java 29,145 14,053 Updated Dec 27, 2024

apache / spark

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,266 28,409 Updated Dec 27, 2024

prestodb / presto

The official home of the Presto distributed SQL query engine for big data

Java 16,136 5,401 Updated Dec 27, 2024

apache / druid

Apache Druid: a high performance real-time analytics database.

Java 13,567 3,714 Updated Dec 25, 2024

StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,453 1,876 Updated Dec 27, 2024

apache / arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 14,781 3,579 Updated Dec 27, 2024

facebookincubator / velox

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,565 1,182 Updated Dec 27, 2024

apache / calcite

Apache Calcite

Java 4,668 2,388 Updated Dec 27, 2024

JerryLead / SparkInternals

Notes talking about the design and implementation of Apache Spark

5,295 1,839 Updated Apr 2, 2024

apache / incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,236 446 Updated Dec 27, 2024

kwai / blaze

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,349 128 Updated Dec 26, 2024

apache / lucene

Apache Lucene open-source search software

Java 2,778 1,054 Updated Dec 26, 2024