ukwa

The UK Web Archive

Community Organization

The British Library

London

Displaying 1 to 25 of 67 repositories

ukwa/heritrix

10K+

1

By ukwa

Updated 5 months ago
Dockerized build of our current production Heritrix
Data Science
Machine Learning & AI
Web Analytics
ukwa/webarchive-discovery

10K+

0

By ukwa

Updated 7 months ago
Databases & Storage
Monitoring & Observability
Web Analytics
ukwa/webarchive-discovery-solr

10K+

0

By ukwa

Updated 7 months ago
A Dockerised version of Solr, including a core based on the webarchive-discovery schema.
Data Science
Databases & Storage
Web Analytics
ukwa/clamd

50K+

0

By ukwa

Updated 9 months ago
ClamD in a docker container.
Integration & Delivery
Monitoring & Observability
Security
ukwa/airflow

1.2K

0

By ukwa

Updated 10 months ago
Apache Airflow container with some additional dependencies
Data Science
Integration & Delivery
Monitoring & Observability
ukwa/robot-framework

294

0

By ukwa

Updated 10 months ago
Robot Framework environment for UKWA
Languages & Frameworks
Integration & Delivery
ukwa/python-w3act

651

0

By ukwa

Updated 10 months ago
Python scripts for working with the W3ACT service
Content Management System
Security
ukwa/ukwa-ui

100K+

0

By ukwa

Updated a year ago
Docker deployment of our collections front-end.
Content Management System
Web Servers
ukwa/ukwa-notebook-apps

846

0

By ukwa

Updated a year ago
Internal reporting apps based on Jupyter notebooks run with Voila.
Data Science
Languages & Frameworks
Machine Learning & AI
ukwa/w3act

10K+

0

By ukwa

Updated a year ago
WWW Annotation and Curation Tool for web archiving.
Content Management System
Databases & Storage
Integration & Delivery
ukwa/superset

129

0

By ukwa

Updated a year ago
A Dockerised Apache Superset including Solr support.
Data Science
Web Analytics
ukwa/crawl-db

40

0

By ukwa

Updated a year ago
ukwa/crawl-streams

2.4K

0

By ukwa

Updated a year ago
Tools for working with our crawl event streams
Data Science
Monitoring & Observability
Web Analytics
ukwa/ukwa-reports

51

0

By ukwa

Updated a year ago
ukwa/ukwa-manage

2.4K

0

By ukwa

Updated a year ago
UKWA crawl lifecycle management tasks.
Content Management System
Data Science
Databases & Storage
ukwa/ukwa-pywb

100K+

0

By ukwa

Updated a year ago
UKWA's version of pywb, with UKWA-specific code and configuration.
Content Management System
Data Science
Web Servers
ukwa/hapy

218

0

By ukwa

Updated a year ago
Dockerized version of our Python command line client for Heritrix3 API operations.
Languages & Frameworks
Integration & Delivery
Web Analytics
ukwa/epub-streamer

969

0

By ukwa

Updated 2 years ago
ukwa/ukwa-site

476

0

By ukwa

Updated 2 years ago
ukwa/ukwa-access-api

2.1K

0

By ukwa

Updated 2 years ago
Service for discovering and interacting with APIs that allow access to UKWA content.
ukwa/webrender-puppeteer

10K+

0

By ukwa

Updated 3 years ago
Web page rendering utility based on Google's Puppeteer
Languages & Frameworks
Integration & Delivery
Web Analytics
ukwa/warc-server

1.7K

0

By ukwa

Updated 3 years ago
Routes WARC requests to the right place
Data Science
Web Servers
Web Analytics