Public Repository

Last pushed: 6 months ago
Short Description
Container with preinstalled Kafka, Spark streaming (PySpark), and Cassandra.
Full Description

This Dockerfile sets up a complete streaming environment for experimenting with Kafka, Spark streaming (PySpark), and Cassandra. It installs

  • Kafka 0.10.2.1
  • Spark 2.1.1 for Scala 2.11
  • Cassandra 3.7

It additionnally installs

  • Anaconda 2.4.4 Python distribution
  • Jupyter notebook for Python

See https://github.com/Yannael/kafka-sparkstreaming-cassandra for details.

Docker Pull Command
Owner
yannael