This repository contains files necessary for getting a PySpark notebook up and running on a VirtualBox machine. The notebook will be running at localhost:8888.
- Install VirtualBox for your system: https://www.virtualbox.org/wiki/Downloads
- Install Vagrant for your system: https://www.vagrantup.com/downloads.html
- Install Git for your system: https://git-scm.com/downloads
- Include Git Bash in your installation.
- Clone this repository to your system.
- In the Git Bash terminal, navigate to the location where this repo was cloned.
- Run
vagrant up
at the command line.- This will download/install the VM and the relevant software.
- Once installed, run
vagrant ssh
to ssh into the Vagrant VM. - Open your preferred web browser and navigate to localhost:8888.
- Enjoy your Jupyter Notebook!
- Ubuntu 16.04.2
- Spark 2.0.2
- Java 1.8
- Anaconda 2