- MultiQC: Summarize analysis results for multiple tools and samples in a single report
- Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
- Bioinformatics (2016)
- doi: 10.1093/bioinformatics/btw354
- PMID: 27312411
-
Statistics from results generated using --quantMode GeneCounts. The three tabs show counts for unstranded RNA-seq, counts for the 1st read strand aligned with RNA and counts for the 2nd read strand aligned with RNA.
-
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
HISAT2
-
HISAT2 is a fast and sensitive alignment program for mapping NGS reads (both DNA and RNA) against a reference genome or population of reference genomes.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Tophat
-
Tophat is a fast splice junction mapper for RNA-Seq reads. It aligns RNA-Seq reads to mammalian-sized genomes.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Bowtie 2
-
Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.
-
-
-
-
-
-
-
-
-
This plot shows the number of reads aligning to the reference in different ways.
-
-
-
-
There are 3 possible types of alignment:
-SE Mapped uniquely: Read has only one occurence in the reference genome.
-SE Multimapped: Read has multiple occurence.
-* SE No aligned: Read has no occurence.
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Cutadapt
-
Cutadapt is a tool to find and remove adapter sequences, primers, poly-Atails and other types of unwanted sequence from your high-throughput sequencing reads.
-
-
-
-
-
-
-
This plot shows the number of reads with certain lengths of adapter trimmed.
- Obs/Exp shows the raw counts divided by the number expected due to sequencing errors. A defined peak
- may be related to adapter length. See the
- cutadapt documentation
- for more information on how these numbers are generated.
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Plot Table Data
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Regex Help
-
-
-
Toolbox search strings can behave as regular expressions (regexes). Click a button below to see an example of it in action. Try modifying them yourself in the text box.
- MultiQC: Summarize analysis results for multiple tools and samples in a single report
- Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
- Bioinformatics (2016)
- doi: 10.1093/bioinformatics/btw354
- PMID: 27312411
-
Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.
-
-
-
-
-
-
-
-
Sort
-
Visible
-
Group
-
Column
-
Description
-
ID
-
Scale
-
-
-
-
-
-
||
-
-
-
-
QUAST
-
N50 (Kbp)
-
N50 is the contig length such that using longer or equal length contigs produces half (50%) of the bases of the assembly.
-
N50
-
None
-
-
-
||
-
-
-
-
QUAST
-
N75 (Kbp)
-
N75 is the contig length such that using longer or equal length contigs produces 75% of the bases of the assembly
-
N75
-
None
-
-
-
||
-
-
-
-
QUAST
-
L50 (K)
-
L50 is the number of contigs larger than N50, i.e. the minimum number of contigs comprising 50% of the total assembly length.
-
L50
-
None
-
-
-
||
-
-
-
-
QUAST
-
L75 (K)
-
L75 is the number of contigs larger than N75, i.e. the minimum number of contigs comprising 75% of the total assembly length.
-
L75
-
None
-
-
-
||
-
-
-
-
QUAST
-
Largest contig (Kbp)
-
The size of the largest contig of the assembly
-
Largest contig
-
None
-
-
-
||
-
-
-
-
QUAST
-
Length (Mbp)
-
The total number of bases in the assembly.
-
Total length
-
None
-
-
-
||
-
-
-
-
QUAST
-
Misassemblies
-
The number of positions in the assembled contigs where the left flanking sequence aligns over 1 kbp away from the right flanking sequence on the reference (relocation) or they overlap on more than 1 kbp (relocation) or flanking sequences align on different strands (inversion) or different chromosomes (translocation).
-
# misassemblies
-
None
-
-
-
||
-
-
-
-
QUAST
-
Mismatches/100kbp
-
The number of mismatches per 100 kbp
-
# mismatches per 100 kbp
-
None
-
-
-
||
-
-
-
-
QUAST
-
Indels/100kbp
-
The number of indels per 100 kbp
-
# indels per 100 kbp
-
None
-
-
-
||
-
-
-
-
QUAST
-
Genes
-
# Genes
-
# genes
-
gene_count
-
-
-
||
-
-
-
-
QUAST
-
Genes (Partial)
-
# Genes (Partial)
-
# genes_partial
-
gene_count
-
-
-
||
-
-
-
-
QUAST
-
Genome Fraction
-
The total number of aligned bases in the reference, divided by the genome size.
-
Genome fraction (%)
-
None
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- Number of Contigs
-
-
-
-
This plot shows the number of contigs found for each assembly, broken
- down by length.
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
RSeQC
-
RSeQC package provides a number of useful modules that can comprehensively evaluate high throughput RNA-seq data.
-
-
-
-
-
-
-
- Read GC Content
-
-
-
-
read_GC calculates a histogram of read GC content.
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
BUSCO
-
BUSCO assesses genome assembly and annotation completeness with Benchmarking Universal Single-Copy Orthologs.
-
-
-
-
-
-
-
- Lineage: fungi_odb9
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
deepTools
-
deepTools is a suite of tools to process and analyze deep sequencing data.
-
-
-
-
-
-
-
- Filtering metrics
-
-
-
-
Estimated percentages of alignments filtered independently for each setting in estimateReadFiltering
Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.
-
-
-
-
-
-
-
-
Sort
-
Visible
-
Group
-
Column
-
Description
-
ID
-
Scale
-
-
-
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
# Sampled
-
Number of fragments sampled
-
Frag. Sampled
-
None
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
Min
-
Minimum fragment length
-
Frag. Len. Min.
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
1st Quartile
-
1st quartile fragment length
-
Frag. Len. 1st. Qu.
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
Mean
-
Mean fragment length
-
Frag. Len. Mean
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
Median
-
Median fragment length
-
Frag. Len. Median
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
3rd Quartile
-
3rd quartile fragment length
-
Frag. Len. 3rd Qu.
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
Max
-
Maximum fragment length
-
Frag. Len. Max
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
Std. Dev.
-
Fragment length standard deviation
-
Frag. Len. Std.
-
frag_length
-
-
-
||
-
-
-
-
deepTools bamPEFragmentSize
-
MAD
-
Fragment length median absolute deviation
-
Frag. Med. Abs. Dev.
-
frag_length
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- Read/fragment length distribution
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Signal enrichment per feature
-
-
-
-
Signal enrichment per feature according to plotEnrichment
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Fingerprint
-
-
-
-
Signal fingerprint according to plotFingerprint
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
SnpEff
-
SnpEff is a genetic variant annotation and effect prediction toolbox. It annotates and predicts the effects of variants on genes (such as amino acid changes).
-
-
-
-
-
-
-
- Variants by Genomic Region
-
-
-
-
-
-
The stacked bar plot shows locations of detected variants in the genome and the number of variants for each location.
-
-
-
-
The upstream and downstream interval size to detect these genomic regions is 5000bp by default.
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Variant Effects by Impact
-
-
-
-
-
-
The stacked bar plot shows the putative impact of detected variants and the number of variants for each impact.
-
-
-
-
There are four levels of impacts predicted by SnpEff:
High: High impact (like stop codon)
Moderate: Middle impact (like same type of amino acid substitution)
Low: Low impact (ie silence mutation)
Modifier: No impact
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Variant Effects by Class
-
-
-
-
-
-
The stacked bar plot shows the effect of variants at protein level and the number of variants for each effect type.
-
-
-
-
This plot shows the effect of variants on the translation of the mRNA as protein. There are three possible cases:
Silent: The amino acid does not change.
Missense: The amino acid is different.
Nonsense: The variant generates a stop codon.
-
-
-
Error - was not able to plot data.
-
-
-
-
-
-
-
-
-
-
- Variant Qualities
-
-
-
-
-
-
The line plot shows the quantity as function of the variant quality score.
-
-
-
-
The quality score corresponds to the QUAL column of the VCF file. This score is set by the variant caller.
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
GATK
-
GATK is a toolkit offering a wide variety of tools with a primary focus on variant discovery and genotyping.
-
-
-
-
-
-
-
- Observed Quality Scores
-
-
-
-
-
-
This plot shows the distribution of base quality scores in each sample before and after base quality score recalibration (BQSR). Applying BQSR should broaden the distribution of base quality scores.
Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.
-
-
-
-
-
-
-
-
Sort
-
Visible
-
Group
-
Column
-
Description
-
ID
-
Scale
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
HTSeq Count
-
HTSeq Count is part of the HTSeq Python package - it takes a file with aligned sequencing reads, plus a list of genomic features and counts how many reads map to each feature.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Bcftools
-
Bcftools contains utilities for variant calling and manipulating VCFs and BCFs.
-
-
-
-
-
-
-
- Variant Substitution Types
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Indel Distribution
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Variant depths
-
-
-
-
Read depth support distribution for called variants
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
featureCounts
-
Subread featureCounts is a highly efficient general-purpose read summarization program that counts mapped reads for genomic features such as genes, exons, promoter, gene bodies, genomic bins and chromosomal locations.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Picard
-
Picard is a set of Java command line tools for manipulating high-throughput sequencing data.
-
-
-
-
-
-
-
- Alignment Summary
-
-
-
-
Plase note that Picard's read counts are divided by two for paired-end data.
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Base Distribution
-
-
-
-
Plot shows the distribution of bases by cycle.
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- GC Coverage Bias
-
-
-
-
This plot shows bias in coverage across regions of the genome with varying GC content. A perfect library would be a flat line at y = 1.
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Insert Size
-
-
-
-
Plot shows the number of reads at a given insert size. Reads with different orientations are summed.
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Mark Duplicates
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- RnaSeqMetrics Assignment
-
-
-
-
Number of bases in primary alignments that align to regions in the reference genome.
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Gene Coverage
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Prokka
-
Prokka is a software tool for the rapid annotation of prokaryotic genomes.
-
-
-
-
-
-
-
-
-
This barplot shows the distribution of different types of features found in each contig.
-
-
-
-
Prokka can detect different features:
-
-
CDS
-
rRNA
-
tmRNA
-
tRNA
-
miscRNA
-
signal peptides
-
CRISPR arrays
-
-
This barplot shows you the distribution of these different types of features found in each contig.
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Samblaster
-
Samblaster is a tool to mark duplicates and extract discordant and split reads from sam files.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Samtools
-
Samtools is a suite of programs for interacting with high-throughput sequencing data.
-
-
-
-
-
-
-
- Percent Mapped
-
-
-
-
-
-
Alignment metrics from samtools stats; mapped vs. unmapped reads.
-
-
-
-
For a set of samples that have come from the same multiplexed library,
-similar numbers of reads for each sample are expected. Large differences in numbers might
-indicate issues during the library preparation process. Whilst large differences in read
-numbers may be controlled for in downstream processings (e.g. read count normalisation),
-you may wish to consider whether the read depths achieved have fallen below recommended
-levels depending on the applications.
-
Low alignment rates could indicate contamination of samples (e.g. adapter sequences),
-low sequencing quality or other artefacts. These can be further investigated in the
-sequence level QC (e.g. from FastQC).
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Alignment metrics
-
-
-
-
This module parses the output from samtools stats. All numbers in millions.
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
- Samtools Flagstat
-
-
-
-
This module parses the output from samtools flagstat. All numbers in millions.
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
Bamtools
-
Bamtools provides both a programmer's API and an end-user's toolkit for handling BAM files.
-
-
-
-
-
-
-
- Bamtools Stats
-
-
-
-
-
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
VCFTools
-
VCFTools is a program for working with and reporting on VCF files.
-
-
-
-
-
-
-
- TsTv by Qual
-
-
-
-
-
-
Plot of TSTV-BY-QUAL - the transition to transversion ratio as a function of SNP quality from the output of vcftools TsTv-by-qual.
-
-
-
-
Transition is a purine-to-purine or pyrimidine-to-pyrimidine point mutations.
-Transversion is a purine-to-pyrimidine or pyrimidine-to-purine point mutation.
-Quality here is the Phred-scaled quality score as given in the QUAL column of VCF.
-Note: only bi-allelic SNPs are used (multi-allelic sites and INDELs are skipped.)
-Refer to Vcftools's manual (https://vcftools.github.io/man_latest.html) on --TsTv-by-qual
-
-
-
loading..
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Plot Table Data
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Regex Help
-
-
-
Toolbox search strings can behave as regular expressions (regexes). Click a button below to see an example of it in action. Try modifying them yourself in the text box.
- MultiQC: Summarize analysis results for multiple tools and samples in a single report
- Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
- Bioinformatics (2016)
- doi: 10.1093/bioinformatics/btw354
- PMID: 27312411
-
Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.
-
-
-
-
-
-
-
-
Sort
-
Visible
-
Group
-
Column
-
Description
-
ID
-
Scale
-
-
-
-
-
-
||
-
-
-
-
Cutadapt
-
% Trimmed
-
% Total Base Pairs trimmed
-
percent_trimmed
-
None
-
-
-
||
-
-
-
-
Flexbar
-
% bp Trimmed
-
% Total Base Pairs removed
-
removed_bases_pct
-
None
-
-
-
||
-
-
-
-
Trimmomatic
-
% Dropped
-
% Dropped reads
-
dropped_pct
-
None
-
-
-
||
-
-
-
-
SortMeRNA
-
% rRNA
-
Percentage of reads matched to a SortMeRNA database
-
rRNA_pct
-
None
-
-
-
||
-
-
-
-
FastQC
-
% Dups
-
% Duplicate Reads
-
percent_duplicates
-
None
-
-
-
||
-
-
-
-
FastQC
-
% GC
-
Average % GC Content
-
percent_gc
-
None
-
-
-
||
-
-
-
-
FastQC
-
Length
-
Average Sequence Length (bp)
-
avg_sequence_length
-
None
-
-
-
||
-
-
-
-
FastQC
-
% Failed
-
Percentage of modules failed in FastQC report (includes those not plotted here)
-
percent_fails
-
None
-
-
-
||
-
-
-
-
FastQC
-
M Seqs
-
Total Sequences (millions)
-
total_sequences
-
read_count
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Cutadapt
-
Cutadapt is a tool to find and remove adapter sequences, primers, poly-Atails and other types of unwanted sequence from your high-throughput sequencing reads.
-
-
-
-
-
-
-
This plot shows the number of reads with certain lengths of adapter trimmed.
- Obs/Exp shows the raw counts divided by the number expected due to sequencing errors. A defined peak
- may be related to adapter length. See the
- cutadapt documentation
- for more information on how these numbers are generated.
2 samples had less than 1% of reads made up of overrepresented sequences
-
-
-
-
-
-
-
-
-
-
- Adapter Content
-
-
-
-
The cumulative percentage count of the proportion of your library which has seen each of the adapter sequences at each position. See the FastQC help. Only samples with ≥ 0.1% adapter contamination are shown.
-
-
-
No samples found with any adapter contamination > 0.1%
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Plot Table Data
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Regex Help
-
-
-
Toolbox search strings can behave as regular expressions (regexes). Click a button below to see an example of it in action. Try modifying them yourself in the text box.