Public Repository

Last pushed: 2 years ago
Short Description
CIP to parse documents such as PDF or DOC for key/value properties.
Full Description

Introduction

The purpose of this CIP is to process an incoming binary document by parsing it and extracting a set of key/value properties before publishing the parsed document for further processing by other CIPs.

Dockerfile

# Docker container for ciao-docs-parser CIP
# Version 0.9
FROM ubuntu:14.04
MAINTAINER HSCIC
RUN apt-get update
RUN apt-get install -y openjdk-7-jdk wget unzip
# Get ciao-docs-parser zipped binary
RUN wget http://hcdn-build1.cloudapp.net/view/CIAO/job/ciao-docs-parser/ws/ciao-docs-parser/target/ciao-docs-parser-1.0.0-SNAPSHOT-bin.zip
# Unzip into /opt/ciao-docs-parser
RUN unzip ciao-docs-parser-1.0.0-SNAPSHOT-bin.zip -d /opt/ciao-docs-parser
# Standard ciao configuration file folder
VOLUME ["/root/.ciao"]
# Input file folder
VOLUME ["/opt/ciao-docs-parser/input"]
# Error file folder
VOLUME ["/opt/ciao-docs-parser/error"]
# In-progress file folder
VOLUME ["/opt/ciao-docs-parser/in-progress"]
# Jolokia Agent port
Expose 8778
# Hazelcast port
Expose 5701
# Entry point
ENTRYPOINT ["/opt/ciao-docs-parser/ciao-docs-parser-1.0.0-SNAPSHOT/run.sh"]

Run

sudo docker run --name="ciao-docs-parser-01" -d -v /home/makelly/ciao-docs-parser/.ciao:/root/.ciao -v /home/makelly/ciao-docs-parser/input:/root/input  -v /home/makelly/ciao-docs-parser/error:/root/error -v /home/makelly/ciao-docs-parser/in-progress:/root/in-progress hscic/ciao-docs-parser
Docker Pull Command
Owner
hscic

Comments (0)