Public | Automated Build

Last pushed: 8 months ago
Short Description
Dockerized Hadoop based on
Full Description

Apache Hadoop 2.7.1 in a Docker container

Hadoop in a pseudo distributed mode in a Docker container, forked from sequenceiq. The added values are:

  • Bug fix
  • Use openjdk
  • Tez installation

Start a container

Make sure that SELinux is disabled on the host. If you are using boot2docker you don't need to do anything.

docker run -P -it ouyi/hadoop-docker /etc/ -bash


You can run one of the stock examples:

# run the mapreduce
bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.1.jar grep input output 'dfs[a-z.]+'

# check the output
bin/hadoop fs -cat output/{*}
Docker Pull Command
Source Repository