An orchestration platform for Docker containers running data mining algorithms.
This project exposes a web interface to execute on demand data mining algorithms defined in Docker containers and implemented using any tool or language (R, Python, Java and more are supported).
It relies on a runtime environment containing Mesos and [Chronos])(https://mesos.github.io/chronos/) to control and execute the Docker containers over a cluster.
Usage
docker run --rm --env [list of environment variables] --link woken hbpmip/woken:2.3.9
where the environment variables are:
CLUSTER_IP: Name of this server advertised in the Akka cluster
CLUSTER_PORT: Port of this server advertised in the Akka cluster
CLUSTER_NAME: Name of Woken cluster, default to 'woken'
WOKEN_PORT_8088_TCP_ADDR: Address of Woken master server
WOKEN_PORT_8088_TCP_PORT: Port of Woken master server, default to 8088
DOCKER_BRIDGE_NETWORK: Name of the Docker bridge network. Default to 'bridge'
NETWORK_INTERFACE: IP address for listening to incoming HTTP connections. Default to '0.0.0.0'
WEB_SERVICES_PORT: Port for the HTTP server in Docker container. Default to 8087
WEB_SERVICES_SECURE: If yes, HTTPS with a custom certificate will be used. Default to no.
WEB_SERVICES_USER: Name used to protected the web servers protected with HTTP basic authentication. Default to 'admin'
WEB_SERVICES_PASSWORD: Password used to protected the web servers protected with HTTP basic authentication.
LOG_LEVEL: Level for logs on standard output, default to WARNING
LOG_CONFIG: on/off - log configuration on start, default to off
VALIDATION_MIN_SERVERS: minimum number of servers with the 'validation' functionality in the cluster, default to 0
SCORING_MIN_SERVERS: minimum number of servers with the 'scoring' functionality in the cluster, default to 0
KAMON_ENABLED: enable monitoring with Kamon, default to no
ZIPKIN_ENABLED: enable reporting traces to Zipkin, default to no. Requires Kamon enabled.
ZIPKIN_IP: IP address to Zipkin server. Requires Kamon and Zipkin enabled.
ZIPKIN_PORT: Port to Zipkin server. Requires Kamon and Zipkin enabled.
PROMETHEUS_ENABLED: enable reporting metrics to Prometheus, default to no. Requires Kamon enabled.
PROMETHEUS_IP: IP address to Prometheus server. Requires Kamon and Prometheus enabled.
PROMETHEUS_PORT: Port to Prometheus server. Requires Kamon and Prometheus enabled.
SIGAR_SYSTEM_METRICS: Enable collection of metrics of the system using Sigar native library, default to no. Requires Kamon enabled.
JVM_SYSTEM_METRICS: Enable collection of metrics of the JVM using JMX, default to no. Requires Kamon enabled.
MINING_LIMIT: Maximum number of concurrent mining operations. Default to 100
EXPERIMENT_LIMIT: Maximum number of concurrent experiments. Default to 100