Public Repository

Last pushed: a year ago
Short Description
Implemented K Means Clustering
Full Description

There are three python files in /src and the data is in /data directory of the container.
To run the container and the particular file do the following:

docker pull singi/assignment5
docker run singi/assignment5 /opt/spark/bin/spark-submit /src/twoDClustering.py /data/random2D.txt 3
docker run singi/assignment5 /opt/spark/bin/spark-submit /src/threeDClustering.py /data/random3D.txt 3
docker run singi/assignment5 /opt/spark/bin/spark-submit /src/fourDClustering.py /data/random4D.txt 3
docker run singi/assignment5 /opt/spark/bin/spark-submit /src/twoDClustering.py /data/mickeySmallOutput.txt 3
docker run singi/assignment5 /opt/spark/bin/spark-submit /src/twoDClustering.py /data/mickeyBigOutput.txt 3

Docker Pull Command
Owner
singi

Comments (0)