NGI-RNAseq is a bioinformatics analysis pipeline used for RNA sequencing data.
It pre-processes raw data from FastQ inputs (FastQC, Trim Galore!), aligns the reads (STAR or HiSAT2), generates gene counts (featureCounts, StringTie) and performs extensive quality-control on the results (RSeQC, dupRadar, Preseq, edgeR, MultiQC). See the output documentation for more details of the results.
The pipeline is built using Nextflow, a bioinformatics workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.
The NGI-RNAseq pipeline comes with documentation about the pipeline, found in the
- Pipeline configuration
- Running the pipeline
- Output and how to interpret the results
These scripts were written at the National Genomics Infrastructure, part of SciLifeLab in Stockholm, Sweden.
The pipeline was developed by Phil Ewels (@ewels) and Rickard Hammarén (@Hammarn). Docker and AWS integration was led by Denis Moreno (@Galithil) and Phil Ewels (@ewels).
NGI-RNAseq is now used by a number of core sequencing and bioinformatics facilities. Some of these are listed below. If you use this pipeline too, please let us know in an issue and we will add you to the list.
<td><img src="https://raw.githubusercontent.com/SciLifeLab/NGI-RNAseq/master/docs/images/NGI_logo.png" width="200"></td>
<td>National Genomics Infrastructure (NGI), Sweden</td>
<td><img src="https://raw.githubusercontent.com/SciLifeLab/NGI-RNAseq/master/docs/images/QBiC_logo.png" width="200"></td>
<td>Quantitative Biology Center (QBiC), Germany</td>