Skip to content
View Zoket's full-sized avatar

Block or report Zoket

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

bigdata

19 repositories

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Java 887 298 Updated Dec 13, 2024

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

Java 3,201 1,177 Updated Dec 13, 2024

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Rust 47,931 1,876 Updated Dec 12, 2024

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,509 983 Updated Dec 15, 2024

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

Java 1,047 223 Updated Nov 26, 2024

The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.

Java 1,140 397 Updated Aug 21, 2024

An Open Standard for lineage metadata collection

Java 1,798 311 Updated Dec 15, 2024

Collect, aggregate, and visualize a data ecosystem's metadata

Java 1,800 322 Updated Dec 14, 2024

基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql,sqlserver,、db2等

ANTLR 307 104 Updated Dec 12, 2024

Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.

Java 370 102 Updated Dec 9, 2024

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 1,128 360 Updated Dec 13, 2024

The Metadata Platform for your Data and AI Stack

Java 10,037 2,973 Updated Dec 15, 2024

Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

Java 545 163 Updated Dec 7, 2024

Big data computing platform based on Spark <至轻云-超轻量级大数据计算平台/数据中台>

Java 126 37 Updated Dec 14, 2024

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

Shell 1,509 366 Updated Nov 14, 2024

Apache Paimon Rust The rust implementation of Apache Paimon.

Rust 107 32 Updated Oct 1, 2024

Lakekeeper: A Rust native Iceberg REST Catalog

Rust 297 20 Updated Dec 13, 2024

运行时动态注册切换数据源,自动生成SQL(DDL/DML/DQL),读写元数据,对比数据库结构差异。适配100+关系/非关系数据库。 常用于动态场景的底层支持,如:数据中台、可视化、低代码后台、工作流、自定义表单、异构数据库迁移同步、物联网车联网数据处理、数据清洗、运行时自定义报表/查询条件/数据结构、爬虫数据解析等

Java 213 34 Updated Dec 11, 2024

A collection of RBIR projects and posts for anyone interested in joining this journey.

Rust 198 7 Updated Dec 15, 2024