<img src="assets/pythia_logo.png" width="400" alt="pythia logo" />
Novelty detection in text corpora
Pythia is Lab41's exploration of approaches to novel content detection. We are interested in making it easier to tell when a document coming into a corpus has something new to say.
We welcome your contributions (see our contributor guidelines) and attention.
docker build -t lab41/pythia . # runs tests and builds project image
Run a quick experiment
docker run -it lab41/pythia experiments/experiments.py with 'XGB=True' 'BOW_APPEND=True' 'BOW_PRODUCT=True'
Our code is written in Python 3. envs/make_envs.sh will install the necessary dependencies on a Debian/Ubuntu system with Anaconda installed.