Public Repository

Last pushed: 2 years ago
Short Description
Base image for containers running scrapy crawler.
Full Description

Base image for containers running scrapy crawler with content type detection and content extraction. Image is based on python 2.7 image with following additional packages:

  • java8
  • python packages: pyyaml simplejson justext tika Scrapy nltk numpy pymongo==2.8
Docker Pull Command
Owner
vizpai

Comments (0)