annotate EMS_VariantDensityMapping.xml @ 2:8fe7a6efbc22

Uploaded
author gregory-minevich
date Tue, 27 Mar 2012 11:29:01 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
2
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
1 <tool id="ems_variant_density_mapping" name="CloudMap: EMS Variant Density Mapping">
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
2 <description>Map a mutation by linkage to regions of high mutation density using WGS data</description>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
3 <command interpreter="python">EMS_VariantDensityMapping.py --snp_vcf $snp_vcf --ylim $ylim --hist_color $hist_color --standardize $standardize --ems $ems --output $output </command>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
4 <inputs>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
5 <param name="snp_vcf" type="data" format="vcf" label="VCF of SNPs" help="Takes a VCF file of WGS SNPs present in a C.elegans mutant strain that has been backcrossed to its (pre-mutagenesis) starting strain"/>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
6 <param name="ylim" size = "15" type="integer" value="200" label="Y-axis upper limit"/>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
7 <param name="hist_color" size = "15" type="text" value="darkgray" label="Color for 1Mb bins" help="See below for list of supported colors"/>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
8 <param name="standardize" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Standardize X-axis" help="Histogram plots from separate chromosomes will have uniform X-axis spacing for comparison"/>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
9 <param name="ems" type="boolean" truevalue="true" falsevalue="false" checked="false" label="Filter for most common EMS-induced variants (G/C—>A/T)"/>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
10 </inputs>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
11 <outputs>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
12 <data name="output" type="text" format="pdf" />
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
13 </outputs>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
14 <requirements>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
15 <requirement type="python-module">sys</requirement>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
16 <requirement type="python-module">optparse</requirement>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
17 <requirement type="python-module">csv</requirement>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
18 <requirement type="python-module">re</requirement>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
19 <requirement type="python-module">decimal</requirement>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
20 <requirement type="python-module">rpy</requirement>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
21 </requirements>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
22 <tests>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
23 <param name="snp_vcf" value="" />
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
24 <output name="output" file="" />
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
25 </tests>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
26 <help>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
27 **What it does:**
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
28
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
29 This tool is part of the CloudMap pipeline for analysis of mutant genome sequences. For further details, please see `Gregory Minevich, Danny Park, Richard J. Poole and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (2012 In Preparation)`__
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
30
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
31 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
32
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
33 Following the approach detailed in Zuryn et al., Genetics 2010, this tool plots histograms of variant density in a mutant C.elegans strain that has been backcrossed to its (pre-mutagenesis) starting strain. Common (i.e. non-phenotype causing) variants present in multiple WGS strains **with the same background** should first be subtracted using the GATK tool *Select Variants*.
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
34
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
35 Sample output where LG III shows linkage to the causal mutation is shown below. In this example, common variants from another strain have been subtracted and remaining variants have been filtered for most common EMS-induced mutations i.e. G/C --> A/T):
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
36
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
37 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/EMS_Variant_Density_750px.png
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
38
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
39
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
40
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
41
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
42
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
43 The experimental approach is detailed in Figure 1a from Zuryn et al., Genetics 2010:
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
44
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
45 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/Zuryn_2010_Genetics_Fig1a.pdf
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
46
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
47
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
48 Subtracting common (non-phenotype causing) variants from more whole genome sequenced strains (using GATK Tools *Select Variants*) will result in less noise and a tighter mapping region. Additional backcrosses will also result in a smaller mapping region.
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
49
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
50 ------
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
51
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
52 **Settings:**
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
53
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
54 .. class:: infomark
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
55
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
56 Supported colors for data points and loess regression line:
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
57
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
58 http://www.stat.columbia.edu/~tzheng/files/Rcolor.pdf
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
59
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
60 http://research.stowers-institute.org/efg/R/Color/Chart/ColorChart.pdf
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
61
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
62
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
63
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
64
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
65 .. class:: warningmark
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
66
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
67 This tool requires that the statistical programming environment R has been installed on the system hosting Galaxy (http://www.r-project.org/). If you are accessing this tool on Galaxy via the Cloud, this does not apply to you.
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
68
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
69 ------
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
70
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
71 **Citation:**
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
72
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
73 This tool is part of the CloudMap package from the Hobert Lab. If you use this tool, please cite `Gregory Minevich, Danny Park, Richard J. Poole and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (2012 In Preparation)`__
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
74
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
75 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
76
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
77 Correspondence to gm2123@columbia.edu (G.M.) or or38@columbia.edu (O.H.)
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
78
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
79 </help>
8fe7a6efbc22 Uploaded
gregory-minevich
parents:
diff changeset
80 </tool>