COOL: a COhort OnLine analytical processing system

COOL is an online cohort analytical processing system that supports various types of data analytics, including cube query, iceberg query and cohort query. The objective of COOL is to provide high performance (near real-time) analytical response for emerging data warehouse domain.

BUILD

Simply run mvn package

BEFORE QUERY

Required sources:

dataset: a csv file with "," delimiter (normally dumped from a database table)
dimension file: a csv file with "," delimiter. Each line of this file has two fields: the first field is the name of a column in the dataset, and the second field is a value of this column. Each distinct value of each column in the dataset shall appear in this dimension file once.
schema file: a json file describing the schema of the dataset.
cube schema: a json file specifying the dimension and measure fileds (Optional).
query file: a yaml file specify the parameters for running query server. Currently, it is only required to specify the location of runtime directory (detailed in Step 2).

We have provided an example for each of the three yaml documents in sogamo directory.

Before query processing, we need to compact the dataset with the following command: java -jar cohana-loader.jar /path/to/table.yaml /path/to/dimension.csv /path/to/dataset.csv /path/to/output/directory 65536 where the five arguments are as follows:

the table.yaml (the third required source)
the dimension file (the second required source)
the dataset file (the first required source)
the output directory for the compacted dataset
the chunk size

HOW TO RUN - COHORT QUERY

We have given an example for cohort query processing in CohortLoader.java.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
sogamo		sogamo
src/main/java/com		src/main/java/com
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COOL: a COhort OnLine analytical processing system

BUILD

BEFORE QUERY

HOW TO RUN - COHORT QUERY

About

Releases

Packages

Contributors 3

Languages

License

cool-squad/cool

Folders and files

Latest commit

History

Repository files navigation

COOL: a COhort OnLine analytical processing system

BUILD

BEFORE QUERY

HOW TO RUN - COHORT QUERY

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages