This example demonstrates how to analyze & visualize New York City traffic incident data using the ELK stack, i.e. Elasticsearch, Logstash and Kibana. The NYPD Motor Vehicle Collision data analyzed in this example is from the NYC Open Data initiative.
Feel free to read the #byodemos: New York city traffic incidents blog post for additional commentary on this analysis. A couple of notes on the blog. The screenshots in the blog post were created with an older version of Kibana. So, don't be alarmed if your Kibana UI looks a little different. Secondly, the good folks at NYC Open Data are great at updating their dataset with latest information. So the visualization and metrics that you see might not match the ones highlighted in the blog post. But, that is the fun part of exploring a living & dynamic dataset, isn't it?
Example has been tested in following versions:
- Elasticsearch 1.7.0
- Logstash 1.5.2
- Kibana 4.1.0
-
Follow the Installation & Setup Guide to install and test the ELK stack (you can skip this step if you already have a working installation of the ELK stack)
-
Run Elasticsearch & Kibana
<path_to_elasticsearch_root_dir>/bin/elasticsearch <path_to_kibana_root_dir>/bin/kibana
-
Check that Elasticsearch and Kibana are up and running.
- Open
localhost:9200
in web browser -- should return status code 200 - Open
localhost:5601
in web browser -- should display Kibana UI.
Note: By default, Elasticsearch runs on port 9200, and Kibana run on ports 5601. If you changed the default ports, change the above calls to use appropriate ports.
- Open
-
Download Data:
Download the CSV version of the NYPD Motor Vehicle Collision dataset from the NYC Open Data Portal. In this example, we are renaming the downloaded CSV file to
nyc_collision_data.csv
.
mkdir nyc_collision
cd nyc_collision
wget https://data.cityofnewyork.us/api/views/h9gi-nx95/rows.csv?accessType=DOWNLOAD -O nyc_collision_data.csv
-
Download Example Files:
Download the following files to the folder containing the downloaded
nyc_collision_data.csv file
:nyc_collision_logstash.conf
- Logstash config for ingesting data into Elasticsearchnyc_collision_template.json
- template for custom mapping of fieldsnyc_collision_kibana.json
- config file to load prebuilt Kibana dashboard
Unfortunately, Github does not provide a convenient one-click option to download entire contents of a subfolder in a repo. You can either (a) download or clone the entire examples repo and navigate to
elk_nyc_accidents
subfolder, or (b) individually download the above files. The code below makes option (b) a little easier:wget https://raw.githubusercontent.com/elastic/examples/master/ELK_nyc_traffic_accidents/nyc_collision_logstash.conf wget https://raw.githubusercontent.com/elastic/examples/master/ELK_nyc_traffic_accidents/nyc_collision_template.json wget https://raw.githubusercontent.com/elastic/examples/master/ELK_nyc_traffic_accidents/nyc_collision_kibana.json
-
Execute the following command to
nyc_collision_data.csv
data into Elasticsearch.cat nyc_collision_data.csv | <path_to_logstash_root_dir>/bin/logstash -f nyc_collision_logstash.conf
-
Verify that data is successfully indexed into Elasticsearch
Running
http://localhost:9200/nyc_visionzero/_count
should return positivecount
value
Note: Included nyc_collision_logstash.conf
configuration file assumes that you are running Elasticsearch on the same host as Logstash and have not changed the defaults. Modify the host
and cluster
settings in the output { elasticsearch { ... } }
section of apache_logstash.conf, if needed.
- Access Kibana by going to
http://localhost:5601
in a web browser - Connect Kibana to the
nyc_visionzero
index in Elasticsearch (autocreated in step 1)- Click the Settings tab >> Indices tab >> Create New. Specify
nyc_visionzero
as the index pattern name and click Create to define the index pattern. (Leave the Use event times to create index names box unchecked)
- Click the Settings tab >> Indices tab >> Create New. Specify
- Load sample dashboard into Kibana
- Click the Settings tab >> Objects tab >> Import, and select
nyc_collision_kibana.json
- Click the Settings tab >> Objects tab >> Import, and select
- Open dashboard
- Click on Dashboard tab and open
NYC Motor Vehicles Collision
dashboard
- Click on Dashboard tab and open
Voila! You should see the following dashboard. Happy Data Exploration!
If you run into issues running this example or have suggestions to improve it, please use Github issues to let us know. Have an easy fix, submit a pull request. We will try our best to respond in a timely manner!
Have you created interesting examples using the ELK stack? Looking for a way to share your amazing work with the community? We would love to include your awesome work here. For more information on how to contribute, check out the Contribution section!