GitHub - avitorovic/squall at 72ffc720a736b6866b244da95aed6be08ebf2778

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 318 Commits
bin		bin
contrib		contrib
deploy		deploy
resources		resources
src		src
test		test
.gitignore		.gitignore
README.markdown		README.markdown

Repository files navigation

![alt text][logo] [logo]: https://raw.githubusercontent.com/epfldata/squall/master/resources/graphics/logo.jpg "Logo Title Text 2"

#Squall Squall is an online query processing engine built on top of Storm. Similar to how Hive provides SQL syntax on top of Hadoop for doing batch processing, Squall executes SQL queries on top of Storm for doing online processing. Squall supports a wide class of SQL analytics ranging from simple aggregations to more advanced UDF join predicates and adaptive rebalancing of load. It is being actively developed by several contributors from the EPFL DATA lab. Squall is undergoing a continuous process of development, currently it supports the following:

Example:

Consider the following SQL query:

SELECT C_MKTSEGMENT, COUNT(O_ORDERKEY)
FROM CUSTOMER join ORDERS on C_CUSTKEY = O_CUSTKEY
GROUP BY C_MKTSEGMENT

Through the Squall API, the online distributed query plan (full code) can be simply formulated as follows:

ProjectOperator projectionCustomer = new ProjectOperator(new int[]{0, 6});
ArrayList<Integer> hashCustomer = new ArrayList<Integer>(Arrays.asList(0));
DataSourceComponent relationCustomer = new DataSourceComponent("CUSTOMER",dataPath + "customer" + extension,
                                                              _queryPlan).addOperator(projectionCustomer)
                                                              .setHashIndexes(hashCustomer);

ProjectOperator projectionOrders = new ProjectOperator(new int[]{1});
ArrayList<Integer> hashOrders = new ArrayList<Integer>(Arrays.asList(0));
DataSourceComponent relationOrders = new DataSourceComponent("ORDERS",dataPath + "orders" + extension,
                                                            _queryPlan).addOperator(projectionOrders)
                                                            .setHashIndexes(hashOrders);

ArrayList<Integer> hashIndexes = new ArrayList<Integer>(Arrays.asList(1));
EquiJoinComponent CUSTOMER_ORDERSjoin = new EquiJoinComponent(relationCustomer,relationOrders,
                                                             _queryPlan).setHashIndexes(hashIndexes);

AggregateCountOperator agg = new AggregateCountOperator().setGroupByColumns(Arrays.asList(1));
OperatorComponent oc = new OperatorComponent(CUSTOMER_ORDERSjoin, "COUNTAGG", _queryPlan).setOperator(agg);

Documentation

Detailed documentation can be found on the Squall wiki.

Contributing to Squall

We'd love to have your help in making Squall better. If you're interested, please communicate with us your suggestions and get your name to the Contributors list.

License

Squall is licensed under Apache License v2.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Example:

Documentation

Contributing to Squall

License

About

Releases

Packages

Languages

License

avitorovic/squall

Folders and files

Latest commit

History

Repository files navigation

Example:

Documentation

Contributing to Squall

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages