aggipp/news-please-cc
10K+
Image to crawl from common crawl news crawl (CCNC) using news-please.
If you are using this container, which is based on news-please, please cite our paper (ResearchGate, Mendeley):
@InProceedings{Hamborg2017,
author = {Hamborg, Felix and Meuschke, Norman and Breitinger, Corinna and Gipp, Bela},
title = {news-please: A Generic News Crawler and Extractor},
year = {2017},
booktitle = {Proceedings of the 15th International Symposium of Information Science},
location = {Berlin},
editor = {Gaede, Maria and Trkulja, Violeta and Petra, Vivien},
pages = {218--223},
month = {March}
}
You can find more information on this and other news projects on our website.
docker pull aggipp/news-please-cc