Public | Automated Build

Last pushed: 2 years ago
Short Description
Ubuntu 15.04 LTS based spark Images. Use `docker-compose` to create spark standalone cluster.
Full Description

Data Science Toolbox with Docker


The spark docker container is based on ubuntu:15.04 image and forked script from

The current setting is:

  1. Spark 1.5.2
  2. Hadoop 2.6.2

Change the ENV on top of the Dockerfile to rebuild with other version.

single local mode

Launch the docker container with current working directory mouted as /data by entrypoint

spark-shell by

docker run --rm -it -p 4040:4040 -v $(pwd):/data rickyking/spark bin/spark-shell

Or pyspark by

docker run --rm -it -p 4040:4040 -v $(pwd):/data rickyking/spark bin/pyspark

spark standalone cluster mode

This requires docker-compose, the prepared yaml file is in spark\docker-compose.yml. Launch by docker-compose up.

Docker Pull Command