Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
anomaly-detection-nyc-taxi.ipynb		anomaly-detection-nyc-taxi.ipynb

README.md

Anomaly Detection

This is a simple example of unsupervised anomaly detection using Analytics Zoo Keras-Style API. We use RNN to predict following data values based on previous sequence (in order) and measure the distance between predicted values and actual values. If the distance is above some threshold, we report those values as anomaly.

Environment

Python 3.5/3.6 (pandas 0.22.0)
Apache Spark 2.x (This version needs to be same with the version you use to build Analytics Zoo)

Install or download Analytics Zoo

Follow the instructions here to install analytics-zoo via pip or download the prebuilt package.

Run Jupyter after pip install

export SPARK_DRIVER_MEMORY=2g
jupyter notebook --notebook-dir=./ --ip=* --no-browser

Run Jupyter with prebuilt package

Run export SPARK_HOME=the root directory of Spark.
Run export ANALYTICS_ZOO_HOME=the folder where you extract the downloaded Analytics Zoo zip package
Run $ANALYTICS_ZOO_HOME/bin/data/NAB/nyc_taxi/get_nyc_taxi.sh to download the dataset. (It can also be downloaded from its github).
Run the following bash command to start the jupyter notebook. Change parameter settings as you need, ie MASTER = local[physcial_core_number].

MASTER=local[4]
${ANALYTICS_ZOO_HOME}/bin/jupyter-with-zoo.sh \
    --master ${MASTER} \
    --driver-cores 4  \
    --driver-memory 2g  \
    --total-executor-cores 4  \
    --executor-cores 4  \
    --executor-memory 2g

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

anomaly-detection

anomaly-detection

README.md

Anomaly Detection

Environment

Install or download Analytics Zoo

Run Jupyter after pip install

Run Jupyter with prebuilt package

Files

anomaly-detection

Directory actions

More options

Directory actions

More options

Latest commit

History

anomaly-detection

Folders and files

parent directory

README.md

Anomaly Detection

Environment

Install or download Analytics Zoo

Run Jupyter after pip install

Run Jupyter with prebuilt package