This repository has a Jupyter Notebook as the main way to interact with Spark 2.0. Additionally I have created some examples that we will be going over during the training.
To be able to run this container:
1) Download the latest version of docker. If you are running windows 8 I believe you need to download Docker Toolbox
2) When docker is installed open a terminal window (like PowerShell in windows) and issue the Docker Pull Command in the upper right hand corner of the screen. Components should start downloading.
3) Once the download is complete,in your terminal issue
"docker run -d --name spark_train -p 8888:8888 jmleon/spark-training " (without the quotes)
4) Issue "docker-machine ip default". if there is an ip address returned, then open a browser and navigate to "<the ip that was returned>:8888". If there was no ip returned navigate to "localhost:8888". The Jupyter notebook should load.