Skip to content

Build Spark Batch/Streaming/MLlib Application by SQL

Notifications You must be signed in to change notification settings

jiyulongxu/streamingpro

Repository files navigation

StreamingPro is a fast, expressive, and convenient system running on Spark with streaming, batch, interactive query, and machine learning support.

StreamingPro makes it easier to build spark applications without writing any code by means of:

  • Using json file in combination with modules, which are easy to be reused. This provides users declarative configurations to build spark applications.
  • Data processing is based on SQL.
  • Script support.

StreamingPro is not only an out-of-box complete application, but also an extensible and programmable framework for spark since you can develop you ower compositors(a.k.a moduler).

Features

  • Pure Spark Streaming (or normal Spark) program
  • No need of coding, only declarative workflows
  • Rest API for interactive querying
  • SQL-Oriented workflows support
  • Data continuously streamed in & processed in near real-time
  • dynamically CURD of workflows at runtime via Rest API
  • Flexible workflows (input, output, parsers, etc)
  • High performance
  • Scalable

Download

Download page: https://pan.baidu.com/s/1i4POWvV

Documents

More Chinese articles: http://www.jianshu.com/c/759bc22b9e15

Architecture

If no picture is shown, please click me.

If github is too slow to view, please click me.

Declarative workflows for building Spark Streaming

If no picture is shown, please click me.

Implementation

If no picture is shown, please click me.

About

Build Spark Batch/Streaming/MLlib Application by SQL

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • JavaScript 63.9%
  • CSS 18.3%
  • HTML 13.7%
  • Java 2.5%
  • Scala 1.5%
  • Python 0.1%