weihanli/puppeteer
Puppeteer is a Node library which provides a high-level API to control Chrome or Chromium over the DevTools Protocol. Puppeteer runs headless by default, but can be configured to run full (non-headless) Chrome or Chromium.
See https://github.com/GoogleChrome/puppeteer
write your own docker file based on this image
FROM weihanli/puppeteer:latest
COPY --chown=pptruser:pptruser ./src .
docker compose
sample docker compose file
version: '2'
services:
puppeteer-crawler:
image: weihanli/puppeteer:latest
ports:
- 10240:3000
volumes:
- ./src:/app
the default command execute is yarn start
see the docker file here
docker pull weihanli/puppeteer