# HG changeset patch # User iuc # Date 1621678398 0 # Node ID eee2da03d1f532557b32050302a6c119c9ec04ee # Parent 3b1e7c170b10afe01883f962fbe85f32e60b52f6 "planemo upload for repository https://github.com/COMBAT-TB/tb_variant_filter commit ef1b4164979078a6624c5b81fa156755258fb5f8" diff -r 3b1e7c170b10 -r eee2da03d1f5 tb_variant_filter.xml --- a/tb_variant_filter.xml Thu Oct 10 17:59:02 2019 -0400 +++ b/tb_variant_filter.xml Sat May 22 10:13:18 2021 +0000 @@ -1,7 +1,7 @@ - + M. tuberculosis H37Rv VCF filter - 0.1.3 + 0.3.0 tb_variant_filter @@ -57,6 +57,7 @@ + @@ -125,6 +126,17 @@ + + + + + + + + + + + `_ - 2. `TBProfiler `_ list of antibiotic resistant genes - 3. `MTBseq `_ list of antibiotic resistant genes - 4. `UVP `_ list of repetitive loci in M. tuberculosis genome + 1. Refined Low Confidence (RLC) regions from `Marin et al 2021 `_ + 2. PE/PPE genes from `Fishbein et al 2015 `_ + 3. `TBProfiler `_ list of antibiotic resistant genes + 4. `MTBseq `_ list of antibiotic resistant genes + 5. `UVP `_ list of repetitive loci in M. tuberculosis genome 2. Filter by window around indels. Masks out variants within a certain distance (by default 5 bases) of an insertion or deletion site. 3. Filter by percentage of alternate allele bases. Mask out variants with less than a minimum percentage (by default 90%) alternative alleles. 4. Filter by depth of aligned reads. 5. Filter out all variants that are not SNV (single nucleotide variants). +For region filtering, the default choice is to use the PE/PPE and UVP regions to mask out variants. `Marin et al 2021 `_ +from Prof Maha Farhat's lab make a persuasive argument that their smaller list of Refined Low Confidence (RLC) regions is a better argument but this work has not yet been peer +review so it is included as an option that is not currently the default. + When used together the effects of the filters are added (i.e. a variant is masked out if it is masked by any of the filters). ]]>