kaohaonan6666

kaohaonan kaohaonan6666

on the way!

hangzhou

Stars

StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,386 1,869 Updated Dec 17, 2024

juicedata / juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,045 978 Updated Dec 17, 2024

analysys / presto-hbase-connector

presto hbase connector 组件基于Presto Connector接口规范实现，用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector，我们的性能要快10到100倍以上。

Java 240 100 Updated Jan 2, 2023

Alluxio / alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 6,884 2,940 Updated Nov 27, 2024

DLLXW / data-science-competition

该仓库用于记录作者本人参加的各大数据科学竞赛的获奖方案源码以及一些新比赛的原创baseline. 主要涵盖：kaggle, 阿里天池，华为云大赛校园赛，百度aistudio，和鲸社区，datafountain等

Python 1,328 473 Updated Apr 21, 2023

smacke / jaydio

A Java library to perform direct I/O in Linux, bypassing file page cache.

Java 315 69 Updated Oct 4, 2022

occlum / occlum

Occlum is a memory-safe, multi-process library OS for Intel SGX

Rust 1,418 235 Updated Dec 14, 2024

intel-analytics / BigDL-2.x

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,667 732 Updated Nov 1, 2024

RedisGraph / JRedisGraph

Java API for RedisGraph

Java 59 20 Updated Nov 27, 2023

apache / beam

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,912 4,276 Updated Dec 17, 2024

apache / dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 13,145 4,672 Updated Dec 16, 2024

pingcap / awesome-database-learning

A list of learning materials to understand databases internals

9,516 1,099 Updated Aug 29, 2024

apache / bahir-flink

Mirror of Apache Bahir Flink

Java 789 430 Updated Oct 30, 2023

antlr / antlr4

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

Java 17,352 3,304 Updated Nov 22, 2024

Netflix / curator

ZooKeeper client wrapper and rich ZooKeeper framework

Java 2,150 434 Updated Mar 24, 2023

apache / zookeeper

Apache ZooKeeper

Java 12,293 7,256 Updated Dec 7, 2024

GoogleCloudPlatform / flink-on-k8s-operator

[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go 659 265 Updated Sep 2, 2022

apache / hadoop

Apache Hadoop

Java 14,830 8,901 Updated Dec 17, 2024

alibaba / Sentinel

A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)

Java 22,491 8,052 Updated Oct 24, 2024

pravega / flink-connectors

Apache Flink connectors for Pravega.

Java 96 68 Updated Mar 4, 2024

airbytehq / airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,475 4,195 Updated Dec 17, 2024