Zoket

Zoket

3 followers · 6 following

Stars

bigdata

19 repositories

apache / amoro

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Java 887 298 Updated Dec 13, 2024

DataLinkDC / dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

Java 3,201 1,177 Updated Dec 13, 2024

meilisearch / meilisearch

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Rust 47,931 1,876 Updated Dec 12, 2024

apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,509 983 Updated Dec 15, 2024

datavane / tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

Java 1,047 223 Updated Nov 26, 2024

datavane / datasophon

The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.

Java 1,140 397 Updated Aug 21, 2024

OpenLineage / OpenLineage

An Open Standard for lineage metadata collection

Java 1,798 311 Updated Dec 15, 2024

MarquezProject / marquez

Collect, aggregate, and visualize a data ecosystem's metadata

Java 1,800 322 Updated Dec 14, 2024

melin / superior-sql-parser

基于 antlr4 的多种数据库SQL解析器，获取SQL中元数据，可用于数据平台产品中的多个场景：ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql，sqlserver,、db2等

ANTLR 307 104 Updated Dec 12, 2024

flowerfine / scaleph

Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.

Java 370 102 Updated Dec 9, 2024

apache / gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 1,128 360 Updated Dec 13, 2024

datahub-project / datahub

The Metadata Platform for your Data and AI Stack

Java 10,037 2,973 Updated Dec 15, 2024

datavane / datavines

Know your data better！Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

Java 545 163 Updated Dec 7, 2024

isxcode / spark-yun

Big data computing platform based on Spark <至轻云-超轻量级大数据计算平台/数据中台>

Java 126 37 Updated Dec 14, 2024

collabH / bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

Shell 1,509 366 Updated Nov 14, 2024

apache / paimon-rust

Apache Paimon Rust The rust implementation of Apache Paimon.

Rust 107 32 Updated Oct 1, 2024

lakekeeper / lakekeeper

Lakekeeper: A Rust native Iceberg REST Catalog

Rust 297 20 Updated Dec 13, 2024

anylineorg / anyline

运行时动态注册切换数据源，自动生成SQL(DDL/DML/DQL)，读写元数据，对比数据库结构差异。适配100+关系/非关系数据库。常用于动态场景的底层支持,如:数据中台、可视化、低代码后台、工作流、自定义表单、异构数据库迁移同步、物联网车联网数据处理、数据清洗、运行时自定义报表/查询条件/数据结构、爬虫数据解析等

Java 213 34 Updated Dec 11, 2024

rewrite-bigdata-in-rust / RBIR

A collection of RBIR projects and posts for anyone interested in joining this journey.

Rust 198 7 Updated Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly