Skip to content
View alldatafounder's full-sized avatar

Block or report alldatafounder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 75,531 11,015 Updated Feb 28, 2025

Apache Iceberg

Java 6,957 2,394 Updated Feb 28, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,659 1,032 Updated Mar 1, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,667 2,460 Updated Feb 28, 2025

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java 8,319 1,914 Updated Feb 27, 2025

ClickHouse® is a real-time analytics database management system

C++ 39,257 7,137 Updated Mar 1, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,221 5,422 Updated Mar 1, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,591 1,926 Updated Mar 1, 2025

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 13,239 3,392 Updated Mar 1, 2025

Flink CDC is a streaming data integration tool

Java 5,973 2,018 Updated Feb 22, 2025

A data integration framework

Java 4,031 1,699 Updated Nov 28, 2024

DataX是阿里云DataWorks数据集成的开源版本。

Java 16,242 5,512 Updated Feb 18, 2025

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

Java 1,092 234 Updated Feb 27, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,165 1,164 Updated Mar 1, 2025

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 3,038 559 Updated Mar 1, 2025

Datart is a next generation Data Visualization Open Platform

TypeScript 2,056 605 Updated Feb 10, 2025

The Metadata Platform for your Data and AI Stack

Java 10,349 3,058 Updated Mar 1, 2025

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 13,290 4,719 Updated Feb 24, 2025

Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

Java 592 177 Updated Feb 20, 2025

CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on und…

FreeMarker 449 110 Updated Feb 20, 2025

The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.

Java 1,182 405 Updated Aug 21, 2024

Make stream processing easier! Easy-to-use streaming application development framework and operation platform.

Java 4,010 1,026 Updated Feb 28, 2025

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

Java 3,330 1,209 Updated Mar 1, 2025

🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo

Java 2,633 879 Updated Feb 27, 2025