Public Repository

Last pushed: 2 years ago
Short Description
Spark 1.10 + Hadoop on Centos
Full Description

Spark 1.10 + Hadoop on Centos
Wordcount test:

1.) Load file to local filesystem (wget...)
2.) Move it HDFS
3.) Run wordcount "./bin/spark-submit test.py nameoftheinput.txt nameoftheoutput

test.py takes file names as arguments.

Docker Pull Command
Owner
immo

Comments (0)