Skip to content
View kaohaonan6666's full-sized avatar
  • hangzhou

Block or report kaohaonan6666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,386 1,869 Updated Dec 17, 2024

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,046 978 Updated Dec 17, 2024

presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。

Java 240 100 Updated Jan 2, 2023

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 6,884 2,940 Updated Nov 27, 2024

该仓库用于记录作者本人参加的各大数据科学竞赛的获奖方案源码以及一些新比赛的原创baseline. 主要涵盖:kaggle, 阿里天池,华为云大赛校园赛,百度aistudio,和鲸社区,datafountain等

Python 1,328 473 Updated Apr 21, 2023

A Java library to perform direct I/O in Linux, bypassing file page cache.

Java 315 69 Updated Oct 4, 2022

Occlum is a memory-safe, multi-process library OS for Intel SGX

Rust 1,418 235 Updated Dec 14, 2024

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Jupyter Notebook 2,667 732 Updated Nov 1, 2024

Java API for RedisGraph

Java 59 20 Updated Nov 27, 2023

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,912 4,277 Updated Dec 17, 2024

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 13,145 4,672 Updated Dec 16, 2024

A list of learning materials to understand databases internals

9,516 1,099 Updated Aug 29, 2024

Mirror of Apache Bahir Flink

Java 789 430 Updated Oct 30, 2023

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

Java 17,352 3,304 Updated Nov 22, 2024

ZooKeeper client wrapper and rich ZooKeeper framework

Java 2,150 434 Updated Mar 24, 2023

Apache ZooKeeper

Java 12,293 7,256 Updated Dec 7, 2024

[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go 659 265 Updated Sep 2, 2022

Apache Hadoop

Java 14,830 8,901 Updated Dec 17, 2024

A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)

Java 22,491 8,052 Updated Oct 24, 2024

Apache Flink connectors for Pravega.

Java 96 68 Updated Mar 4, 2024

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,476 4,195 Updated Dec 17, 2024

flink sql to oracle

Java 32 12 Updated Sep 13, 2020

subscribe from dts data store

Java 63 76 Updated Jun 17, 2022

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Go 2,817 1,383 Updated Dec 17, 2024

😎 A curated list of amazingly awesome Flink and Flink ecosystem resources

775 112 Updated Jun 8, 2023

Benchmarks for Apache Flink

Java 172 86 Updated Nov 4, 2024

Benchmarks for queries over continuous data streams.

Java 320 104 Updated Nov 21, 2024

Time Series Benchmark Suite, a tool for comparing and evaluating databases for time series data

Go 1,314 306 Updated Aug 6, 2024

DataX是阿里云DataWorks数据集成的开源版本。

Java 16,092 5,475 Updated Oct 17, 2024
Next