Public Repository

Last pushed: 2 years ago
Short Description
PDF Parse Demo based on Cloudera Quickstart image.
Full Description

This image pulls public PDF documents from S3 and loads them into HBase for text mining.
The foundation for the image is described in the Cloudera's Blog:

Docker Pull Command