Public Repository

Last pushed: 3 years ago
Short Description
Get Up and Running in 3'ish Easy Steps
Full Description

Dockerfile as follows:

FROM ubuntu:14.04

RUN apt-get update

RUN apt-get install -y default-jdk

ENV JAVA_HOME=/usr/lib/jvm/default-java

RUN apt-get install -y git

RUN apt-get install -y maven

RUN git clone git://git.apache.org/samza-hello-samza.git hello-samza

RUN apt-get install -y curl

RUN cd hello-samza && bin/grid bootstrap

FROM http://samza.apache.org/startup/hello-samza/0.9/

1.) start the container with:

docker run --rm --name hello-samza --net host -it -p 8088:8088 anaerobic/hello-samza bash

2.) build the "environment" and "deploy" the hello-samza tarball with:

cd hello-samza

bin/grid bootstrap

mvn clean package

mkdir -p deploy/samza

tar -xvf ./target/hello-samza-0.9.0-dist.tar.gz -C deploy/samza

3.) Run the jobs (and check their outputs) with:

deploy/samza/bin/run-job.sh --config-factory=org.apache.samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-feed.properties

deploy/kafka/bin/kafka-console-consumer.sh  --zookeeper localhost:2181 --topic wikipedia-raw

deploy/samza/bin/run-job.sh --config-factory=org.apache.samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-parser.properties

deploy/samza/bin/run-job.sh --config-factory=org.apache.samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-stats.properties

deploy/kafka/bin/kafka-console-consumer.sh  --zookeeper localhost:2181 --topic wikipedia-edits

deploy/kafka/bin/kafka-console-consumer.sh  --zookeeper localhost:2181 --topic wikipedia-stats
Docker Pull Command
Owner
anaerobic