Public | Automated Build

Last pushed: 2 years ago
Short Description
Data Science Toolbox for the Johns Hopkins University Data Science Specialisation
Full Description

Data Scientist's Toolbox

The data-scientists-toolbox repository makes it easy to create a Docker container for use during the Data Science Specialization.

Download and install the Docker software for Apple Mac OS X, GNU/Linux or Microsoft Windows following the instructions at the website.

Retrieve the data-scientists-toolbox build repository to create the container. Git, R, RStudio Server, and vim are installed as part of the image. Additional R packages can be added after the container is created.

$ git clone https://github.com/gdhorne/data-scientists-toolbox

For ease of instantiating an instance of the container image a script named 'container.sh' can be used to manage the entire lifecycle. For Microsoft Windows users it is recommended that Git Bash be installed instead of the standard Git software because it provides an *nix-like command line environment.

Create the container, optionally mapping a host file system share for storage. The file system share name /home/me/datascience' is user selectable and host file system dependent. If no local file system share is desired simply omit the fourth argument '/home/me/datascience'. The container instance name 'toolbox', in these instructions, is user selectable at time of creation.

 $ ./container.sh create toolbox gdhorne/data-scientists-toolbox 
                           /home/me/datascience

Apple Mac OS X: /Users/username/directory
GNU/Linux: /home/username/directory
Microsoft Windows: /c/Users/directory (allegedly)

Verify the container 'toolbox' has been successfully created and is running.

 $ ./container.sh status

Stop the container 'toolbox'.

$ ./container.sh stop toolbox

Start the container 'toolbox'.

$ ./container.sh start toolbox

To learn more about the container lifecycle management features supported by 'container.sh' type,

$ ./container.sh --help

Applications

After creating the container these applications are accessible within a web browser.

Git:        Accessible via WeTTY and via RStudio integration

R:          Accessible via WeTTY

RStudio:    http://127.0.0.1:8787

            UserID: dst 
            Password: science

WeTTY:      http://127.0.0.1:8000

            UserID: dst 
            Password: science

            To enable the terminal/console management utility 
            type 'screen' and press ENTER.

Alternatively, the data-scientists-toolbox image provides a traditional command line interface, without WeTTY, to some applications such as Git, R, and vim. For convenience the terminal/console management utility 'screen' has been installed and starts automatically.

$ ./container.sh attach toolbox

Press ENTER if the container's shell prompt does not appear. To exit the container and leave it running press CTRL+P, CTRL+Q; this is the preferred method. To exit the container and stop it type 'exit'.

Docker Pull Command
Owner
gdhorne
Source Repository