annotate EMS_VariantDensityMapping.xml @ 9:58a3878549ef draft

Uploaded
author gregory-minevich
date Mon, 25 Jun 2012 16:09:09 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
9
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
1 <tool id="ems_variant_density_mapping" name="CloudMap: EMS Variant Density Mapping">
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
2 <description>Map a mutation by linkage to regions of high mutation density using WGS data</description>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
3 <command interpreter="python">EMS_VariantDensityMapping.py --snp_vcf $snp_vcf --ylim $ylim --hist_color "$hist_color" --standardize $standardize --ems $ems --output $output </command>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
4 <inputs>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
5 <param name="snp_vcf" type="data" format="vcf" label="VCF of SNPs" help="Takes a VCF file of WGS SNPs present in a C.elegans mutant strain that has been backcrossed to its (pre-mutagenesis) starting strain"/>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
6 <param name="ylim" size = "15" type="integer" value="200" label="Y-axis upper limit"/>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
7 <param name="hist_color" size = "15" type="text" value="darkgray" label="Color for 1Mb bins" help="See below for list of supported colors"/>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
8 <param name="standardize" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Standardize X-axis" help="Frequency plots from separate chromosomes will have uniform X-axis spacing for comparison"/>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
9 <param name="ems" type="boolean" truevalue="true" falsevalue="false" checked="true" label="Filter for most common EMS-induced variants (G/C—>A/T)"/>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
10 </inputs>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
11 <outputs>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
12 <data name="output" type="text" format="pdf" />
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
13 </outputs>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
14 <requirements>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
15 <requirement type="python-module">sys</requirement>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
16 <requirement type="python-module">optparse</requirement>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
17 <requirement type="python-module">csv</requirement>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
18 <requirement type="python-module">re</requirement>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
19 <requirement type="python-module">decimal</requirement>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
20 <requirement type="python-module">rpy</requirement>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
21 </requirements>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
22 <tests>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
23 <param name="snp_vcf" value="" />
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
24 <output name="output" file="" />
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
25 </tests>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
26 <help>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
27 **What it does:**
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
28
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
29 This tool is part of the CloudMap pipeline for analysis of mutant genome sequences. For further details, please see `Gregory Minevich, Danny Park, Richard J. Poole, Daniel Blankenberg, Anton Nekrutenko, and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (2012 In Preparation)`__
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
30
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
31 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
32
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
33 CloudMap workflows, shared histories and reference datasets are available at the `CloudMap Galaxy page`__
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
34
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
35 .. __: https://test.g2.bx.psu.edu/u/gal40/p/cloudmap
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
36
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
37 Following the approach detailed in Zuryn et al., Genetics 2010, this tool plots histograms of variant density in a mutant C.elegans strain that has been backcrossed to its (pre-mutagenesis) starting strain. Common (i.e. non-phenotype causing) variants present in multiple WGS strains **with the same background** should first be subtracted using the GATK tool *Select Variants*.
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
38
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
39 Sample output where LG III shows linkage to the causal mutation is shown below. In this example, common variants from another strain have been subtracted and remaining variants have been filtered for most common EMS-induced mutations i.e. G/C --> A/T):
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
40
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
41 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/EMS_Variant_Density_750px.png
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
42
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
43
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
44
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
45
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
46
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
47 The experimental approach is detailed in Figure 1a from Zuryn et al., Genetics 2010:
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
48
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
49 .. image:: http://biochemistry.hs.columbia.edu/labs/hobert/CloudMap/Zuryn_2010_Genetics_Fig1a.pdf
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
50
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
51
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
52 Subtracting common (non-phenotype causing) variants from more whole genome sequenced strains (using GATK Tools *Select Variants*) will result in less noise and a tighter mapping region. Additional backcrosses will also result in a smaller mapping region.
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
53
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
54 ------
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
55
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
56 **Settings:**
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
57
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
58 .. class:: infomark
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
59
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
60 Supported colors for data points and loess regression line:
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
61
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
62 http://www.stat.columbia.edu/~tzheng/files/Rcolor.pdf
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
63
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
64 http://research.stowers-institute.org/efg/R/Color/Chart/ColorChart.pdf
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
65
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
66
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
67
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
68
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
69 .. class:: warningmark
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
70
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
71 This tool requires that the statistical programming environment R has been installed on the system hosting Galaxy (http://www.r-project.org/). If you are accessing this tool on Galaxy via the Cloud, this does not apply to you.
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
72
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
73 ------
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
74
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
75 **Citation:**
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
76
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
77 This tool is part of the CloudMap package from the Hobert Lab. If you use this tool, please cite `Gregory Minevich, Danny Park, Richard J. Poole, Daniel Blankenberg, Anton Nekrutenko, and Oliver Hobert. CloudMap: A Cloud-based Pipeline for Analysis of Mutant Genome Sequences. (2012 In Preparation)`__
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
78
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
79 .. __: http://biochemistry.hs.columbia.edu/labs/hobert/literature.html
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
80
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
81 Correspondence to gm2123@columbia.edu (G.M.) or or38@columbia.edu (O.H.)
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
82
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
83 </help>
58a3878549ef Uploaded
gregory-minevich
parents:
diff changeset
84 </tool>