debian:jessie based Spark and Zeppelin Docker container.
This image is large and opinionated. It contains:
by default the credentials file can be found here , it is also mapped to the docker image under
So please update your credentials file to interact with your BigQuery
To start Zeppelin pull the
latest image and run the container:
docker pull samelamin/docker-zeppelin docker run --rm -p 8080:8080 samelamin/docker-zeppelin
Zeppelin will be running at
You can use docker-compose to easily run Zeppelin in more complex configurations. See this project's
./examples directory for examples of using Zeppelin with
docker-compose to :
- read and write from local data files
- read and write documents in ElasticSearch
onbuild container is still a part of this project, but I have no plans to keep it updated. See the
onbuild directory to view its
To use it, create a new
Dockerfile based on
samelamin/docker-zeppelin:onbuild and supply a new, executable
install.sh file in the same directory. It will override the base one via Docker's ONBUILD instruction.
The steps, expressed here as a script, can be as simple as:
#!/bin/bash cat > ./Dockerfile <<DOCKERFILE FROM samelamin/docker-zeppelin:onbuild ENV ZEPPELIN_MEM="-Xmx1024m" DOCKERFILE cat > ./install.sh <<INSTALL git pull mvn clean package -DskipTests \ -Pspark-1.5 \ -Dspark.version=1.5.2 \ -Phadoop-2.2 \ -Dhadoop.version=2.0.0-cdh4.2.0 \ -Pyarn INSTALL docker build -t my_zeppelin .
Copyright 2016 samelamin.
Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0