Skip to content
View VincentSleepless's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 众安在线财险有限公司
  • Shanghai

Block or report VincentSleepless

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1 Updated Oct 30, 2024

Grammars written for ANTLR v4; expectation that the grammars are free of actions.

ANTLR 10,290 3,723 Updated Dec 22, 2024

Examples and guides for using the OpenAI API

MDX 60,773 9,693 Updated Dec 22, 2024

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

Java 6,188 1,849 Updated Dec 23, 2024

CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on und…

FreeMarker 449 108 Updated Dec 18, 2024

腾讯高性能分布式图计算框架Plato

C++ 1,902 331 Updated Aug 14, 2021

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,624 3,044 Updated Dec 23, 2024

DataGear数据可视化分析平台,自由制作任何您想要的数据看板

Java 1,474 340 Updated Dec 23, 2024

𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust 8,003 753 Updated Dec 23, 2024

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,070 980 Updated Dec 23, 2024

The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.

Java 1,146 401 Updated Aug 21, 2024

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…

Java 14,596 3,920 Updated May 25, 2024

The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.

Java 373 168 Updated May 10, 2024

Apache InLong - a one-stop, full-scenario integration framework for massive data

Java 1,407 534 Updated Dec 23, 2024

Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display

Java 1,341 335 Updated Aug 7, 2024

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…

Java 1,629 334 Updated Jan 1, 2024

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java 8,161 1,857 Updated Dec 23, 2024

蓝鲸作业平台(Job)是一套运维基础操作管理系统,具备海量任务并发处理能力。除了支持脚本执行、文件分发、定时任务等一系列基础运维场景以外,还支持通过流程调度能力将零碎的单个任务组装成一个自动化作业流程;而每个作业都可做为一个原子节点,提供给上层或周边系统/平台使用,实现调度自动化。

Java 1 Updated Jan 18, 2023

数据血缘

Java 57 26 Updated Dec 9, 2024

基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql,sqlserver,、db2等

ANTLR 308 105 Updated Dec 12, 2024

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python 4,460 961 Updated Dec 19, 2024
Java 1,617 283 Updated Dec 17, 2024

Apache Iceberg

Java 6,680 2,290 Updated Dec 23, 2024

解析SQL,获取字段、表级别的血缘关系。转换成血缘模型,在图数据库neo4j上呈现。

Java 173 80 Updated Nov 17, 2020

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,353 858 Updated Aug 22, 2023
Java 6 4 Updated Sep 25, 2020

Apache Atlas

Java 1,853 850 Updated Dec 21, 2024

实现yarn客户端,datax-on-yarn可以让datax在yarn master上运行

Java 16 12 Updated Nov 14, 2023

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Python 34,164 10,258 Updated Dec 18, 2024

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

Java 1,053 224 Updated Dec 22, 2024
Next