loum/hadoop-hive

By loum

Updated over 3 years ago

Quick and easy way to get Hive running in Hadoop pseudo distributed mode on Docker

Image
2

373

Supported tags and respective Dockerfile links

Apache Hadoop version 3.3.1|Apache Hive 3.1.2

Apache Hadoop version 3.2.2|Apache Hive 3.1.2

Quick reference

Quick start

Start the container

docker run --rm -d --name hadoop-hive\
 --env YARN_SITE__YARN_NODEMANAGER_AUX_SERVICES=mapreduce_shuffle\
 --env YARN_SITE__YARN_LOG_AGGREGATION_ENABLE=true\
 --publish 9000:9000\
 --publish 8088:8088\
 --publish 8042:8042\
 --publish 9870:9870\
 --publish 10000:10000\
 --publish 10002:10002\
 --publish 19888:19888\
 loum/hadoop-hive:latest

Hadoop ResourceManager is available at http://localhost:8088

HiveServer2 web UI is available at http://localhost:10002

Start shell

docker exec -ti hadoop-hive bash

Check the Hadoop Command Reference

Start beeline

docker exec -ti hadoop-hive bash -c \
 "HADOOP_HOME=/opt/hadoop /opt/hive/bin/beeline -u jdbc:hive2://localhost:10000"

Check the HiveServer2 Clients for more beeline commands.

Terminate container

docker stop hadoop-hive

Docker Pull Command

docker pull loum/hadoop-hive