Public Repository

Last pushed: a year ago
Short Description
This image provides a data science environment in rstudio with spark, h2o and the tidyverse.
Full Description

Image intended to experiment with spark+R using Rstudio and sparklyr.
Machine learning using h2o sparkling water.

ubuntu, rstudio, java installs along with the following R packages :
sparklyr
dplyr
stringr
devtools

apart from the packages h2o depends on.

run container using

docker run -d -p <portID>:8787 --name <name> -v <local volume with data> pbhogale/sparklyr_rstudio

where the local volume with data will be mapped to /home/rstudio/data in the container.

to refresh the spark installation, run

library(sparklyr)
spark_install()

after running container.

Docker Pull Command
Owner
pbhogale

Comments (0)