ETL
轻量级任务批量调度框架,实现任务编排拓扑分析,适合多任务处理场景,比如ETL,数据质量分析等
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
An orchestration platform for the development, production, and observation of data assets.
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Pentaho Data Integration ( ETL ) a.k.a Kettle
Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
Data develop engine 数据研发引擎,用可视化的组件编排后台数据处理逻辑,配合消息触发、定时任务和restful接口等多种调度机制,从而能够代替传统的大部分后台接口研发。
Real-time ETL developed by Flink, data from MySQL to Greenplum. Use canal to parse the MySQL binlog, put it into kafka, use Flink to consume kafka and assemble the data into Greenplum, and more dat…
通过画流程图的方式动态的生成数据处理程序对数据进行处理。 使用Java作为调度引擎,集成python端作为数据处理模块的一个dataops
A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.
为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
基于当前互联网最热的springboot微服务架构,采用丰富的vue、iview等前端组件打造的kettle调度监控服务平台,解决了企业实际数据抽取业务场景中,无法实现kettle web端配置、调用、监控的痛点
基于当前互联网最热的springboot微服务架构,采用丰富的vue、iview等前端组件打造的kettle调度监控服务平台,解决了企业实际数据抽取业务场景中,无法实现kettle web端配置、调用、监控的痛点
实现yarn客户端,datax-on-yarn可以让datax在yarn master上运行
基于阿里Datax改版web datax ,支持管理平台与restful风格API
DataX分布式集群与负载均衡、任务执行/统计,基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步
webSpoon is a web-based graphical designer for Pentaho Data Integration with the same look & feel as Spoon