Skip to content
forked from pranab/beymani

Hadoop and Storm based outlier analysis implementations for cyber security and fraud detection

Notifications You must be signed in to change notification settings

maheshsv/beymani

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

Beymani consists of set of Hadoop and Storm based tools for outlier and anamoly detection, which can be used for fraud detection, intrusion detection. All the implementations will be ported to Spark.

Philosophy

  • Simple to use
  • Input output in CSV format
  • Metadata defined in simple JSON file
  • Extremely configurable with tons of configuration knobs

Blogs

The following blogs of mine are good source of details of beymani

Algorithms

  • Multi variate instance distribution model
  • Multi variate sequence or multi gram distribution model
  • Average instance Distance
  • Relative instance Density
  • Markov chain with sequence data
  • Instance clustering
  • Sequence clustering

Getting started

Project's resource directory has various tutorial documents for the use cases described in the blogs.

Build

For Hadoop 1

  • mvn clean install

For Hadoop 2 (non yarn)

  • git checkout nuovo
  • mvn clean install

For Hadoop 2 (yarn)

  • git checkout nuovo
  • mvn clean install -P yarn

Help

Please feel free to email me at [email protected]

Contribution

Contributors are welcome. Please email me at [email protected]

About

Hadoop and Storm based outlier analysis implementations for cyber security and fraud detection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 86.4%
  • Shell 5.4%
  • Ruby 4.0%
  • Scala 2.9%
  • Python 1.3%