Public | Automated Build

Last pushed: 2 years ago
Short Description
metaPGAP - metagenomic Pan Genome Analysis Pipeline for generating core genome from multiple strains
Full Description


metaPGAP is a pipeline for building core genome using PAN genome approach. This pipeline takes complete or draft genome assemblies and perform annotation using Prokka. The Prokka predictions then used to build core genome using get_homologues tool. For core genes phylogeny Mafft and RaXML used.

Full list of required softwares and dependencies:

metaPGAP steps:

(1). Download data from
(2). Genome Annotation using Prokka
(3). PAN genome analysis using get_homologues: BDBH, COG and OMCL
(4). Multiple sequence alignment of CORE genes
(5). Phylogenetic analysis of CORE genes using RaXML
(6). Visualization of phylogenetic tree using Newick tools

metaPGAP requirements:

Python (
BioPython (1.5 or higher) with NumPy (
Prokka (
get_homologues (
Mafft (
Perl (
Newick tools (

General Use


Docker PULL Command:

 docker pull mitulpatel/metapgap

Docker RUN Command:

  docker run -v `pwd`:/metaPGAP -w /metaPGAP mitulpatel/metapgap
Docker Pull Command
Source Repository