Skip to content
View VincentSleepless's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 众安在线财险有限公司
  • Shanghai

Block or report VincentSleepless

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

81 results for source starred repositories
Clear filter

Grammars written for ANTLR v4; expectation that the grammars are free of actions.

ANTLR 10,325 3,723 Updated Jan 5, 2025

Examples and guides for using the OpenAI API

MDX 61,099 9,775 Updated Jan 10, 2025

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

Java 6,207 1,859 Updated Jan 10, 2025

CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on und…

FreeMarker 452 108 Updated Jan 10, 2025

腾讯高性能分布式图计算框架Plato

C++ 1,902 330 Updated Aug 14, 2021

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,687 3,060 Updated Jan 11, 2025

DataGear数据可视化分析平台,自由制作任何您想要的数据看板

Java 1,480 341 Updated Jan 11, 2025

𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust 8,066 754 Updated Jan 11, 2025

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,138 988 Updated Jan 10, 2025

The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.

Java 1,157 403 Updated Aug 21, 2024

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…

Java 14,614 3,922 Updated May 25, 2024

The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.

Java 373 168 Updated May 10, 2024

Apache InLong - a one-stop, full-scenario integration framework for massive data

Java 1,408 534 Updated Jan 10, 2025

Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display

Java 1,346 334 Updated Aug 7, 2024

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…

Java 1,633 335 Updated Jan 1, 2024

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java 8,213 1,875 Updated Jan 10, 2025

解析 SQL 字段数据血缘

Java 61 28 Updated Jan 9, 2025

基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql,sqlserver,、db2等

ANTLR 317 108 Updated Jan 9, 2025

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python 4,468 961 Updated Jan 8, 2025
Java 1,620 285 Updated Dec 17, 2024

Apache Iceberg

Java 6,748 2,306 Updated Jan 10, 2025

解析SQL,获取字段、表级别的血缘关系。转换成血缘模型,在图数据库neo4j上呈现。

Java 173 80 Updated Nov 17, 2020

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,349 856 Updated Aug 22, 2023

Apache Atlas

Java 1,864 851 Updated Jan 9, 2025

实现yarn客户端,datax-on-yarn可以让datax在yarn master上运行

Java 17 12 Updated Nov 14, 2023

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 34,260 10,294 Updated Dec 29, 2024

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

Java 1,063 228 Updated Jan 10, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,856 4,239 Updated Jan 11, 2025

DataLink是一个满足各种异构数据源之间的实时增量同步、离线全量同步,分布式、可扩展的数据交换平台。

Java 1,089 413 Updated Dec 6, 2022

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 13,248 4,692 Updated Jan 10, 2025
Next