Public | Automated Build

Last pushed: 2 years ago
Short Description
Short description is empty for this repo.
Full Description

Docker OpenWayback

A Docker container image for Open Wayback.

Following command will run a container with default Wayback configuration:

$ docker run --rm -it -v /tmp/warc-files:/wayback/warcs -p 8080:8080 ibnesayeed/wayback:2.1.0

The above command assumes that data files *.[w]arc[.gz] are kept in /tmp/warc-files directory of the host machine. Change the location with absolute path of the data directory as necessary, but make sure that it is mounted in the /wayback/warcs volume of the docker container from where it will be picked up automatically by the BDB indexer.

Internally the container will run the application on port 8080, but it can be bound to a different port of the host machine by changing 8080:8080 to SOMEHOSTPORT:8080.

If everything goes well, you should be able to access your collection. It may take a while before all the data files are indexed.

$ curl -i http://0.0.0.0:8080/wayback/[URI-R]

Customization

Following environment variables have been made available to override some basic configurations of the wayback.xml file without replacing it with a new custom file. It is possible to replace configuration files by mounting files from the host system into /usr/local/tomcat/webapps/ROOT/WEB-INF volume of the container at run time.

  • WAYBACK_URL_SCHEME
  • WAYBACK_URL_HOST
  • WAYBACK_URL_PORT
  • WAYBACK_URL_PREFIX

Following is an example that will set the host of Wayback to example.com. Pass multiple environment variables by adding multiple -e command line options.

$ docker run --rm -it -e "WAYBACK_URL_HOST=example.com" -v /tmp/warc-files:/wayback/warcs -p 8080:8080 ibnesayeed/wayback:2.1.0
Docker Pull Command
Owner
ibnesayeed
Source Repository

Comments (0)