Skip to content
View laofan13's full-sized avatar
🎯
Focusing
🎯
Focusing
  • shanghai
  • 10:16 (UTC -12:00)

Block or report laofan13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

bigData

BigData System
19 repositories

Apache ZooKeeper

Java 12,317 7,256 Updated Dec 21, 2024

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,341 2,561 Updated May 7, 2024

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,528 2,439 Updated Dec 25, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,705 1,735 Updated Dec 21, 2024

Apache Iceberg

Java 6,696 2,293 Updated Dec 27, 2024

Apache Hive

Java 5,597 4,698 Updated Dec 23, 2024

汇总Apache Hudi相关资料

541 158 Updated Dec 22, 2024

Mirror of Apache Kafka

Java 29,145 14,053 Updated Dec 27, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,266 28,409 Updated Dec 27, 2024

The official home of the Presto distributed SQL query engine for big data

Java 16,136 5,401 Updated Dec 27, 2024

Apache Druid: a high performance real-time analytics database.

Java 13,567 3,714 Updated Dec 25, 2024

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,453 1,876 Updated Dec 27, 2024

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 14,781 3,579 Updated Dec 27, 2024

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,565 1,182 Updated Dec 27, 2024

Apache Calcite

Java 4,668 2,388 Updated Dec 27, 2024

Notes talking about the design and implementation of Apache Spark

5,295 1,839 Updated Apr 2, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,236 446 Updated Dec 27, 2024

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,349 128 Updated Dec 26, 2024

Apache Lucene open-source search software

Java 2,778 1,054 Updated Dec 26, 2024