comparison offspring_heterozygosity_pedigree.xml @ 31:a631c2f6d913

Update to Miller Lab devshed revision 3c4110ffacc3
author Richard Burhans <burhans@bx.psu.edu>
date Fri, 20 Sep 2013 13:25:27 -0400
parents
children
comparison
equal deleted inserted replaced
30:4188853b940b 31:a631c2f6d913
1 <tool id="gd_offspring_heterozygosity_pedigree" name="Founders sequenced" version="1.0.0">
2 <description>: Offspring estimated heterozygosity from a pedigree with sequenced founders</description>
3
4 <command interpreter="python">
5 #import json
6 #import base64
7 #import zlib
8 #set $ind_names = $input.dataset.metadata.individual_names
9 #set $ind_colms = $input.dataset.metadata.individual_columns
10 #set $ind_dict = dict(zip($ind_names, $ind_colms))
11 #set $ind_json = json.dumps($ind_dict, separators=(',',':'))
12 #set $ind_comp = zlib.compress($ind_json, 9)
13 #set $ind_arg = base64.b64encode($ind_comp)
14 offspring_heterozygosity_pedigree.py '$input' '$input.ext' '$pedigree' '$ind_arg' '$founders' '$b1_input' '$b2_input' '$output'
15 </command>
16
17 <inputs>
18 <param name="input" type="data" format="gd_snp,gd_genotype" label="SNP dataset" />
19 <param name="pedigree" type="data" format="txt" label="Pedigree dataset" />
20 <param name="founders" type="data" format="gd_indivs" label="Founders dataset" />
21 <param name="b1_input" type="data" format="txt" label="First breeders dataset" />
22 <param name="b2_input" type="data" format="txt" label="Second breeders dataset" />
23 </inputs>
24
25 <outputs>
26 <data name="output" format="txt" />
27 </outputs>
28
29 <requirements>
30 <requirement type="package" version="0.1">gd_c_tools</requirement>
31 </requirements>
32
33 <!--
34 <tests>
35 </tests>
36 -->
37
38 <help>
39
40 **Dataset formats**
41
42 The input datasets are in gd_snp_, gd_genotype_, text_, and gd_indivs_ formats.
43 The output dataset is in text_ format.
44
45 .. _gd_snp: ./static/formatHelp.html#gd_snp
46 .. _gd_genotype: ./static/formatHelp.html#gd_genotype
47 .. _gd_indivs: ./static/formatHelp.html#gd_indivs
48 .. _text: ./static/formatHelp.html#text
49
50 -----
51
52 **What it does**
53
54 The user provides a Galaxy SNP table (gd_snp or gd_genotype format) that
55 includes the founders of a pedigree, as well as two sets of individuals.
56 The pedigree is specified by a text file with one row per individual,
57 containing (1) the individual's name, (2) the name of one of the
58 individual's parents, which must have occurred at the start of a previous
59 line, and (3) the name of the individual's other parent, which occurred at
60 the start of a previous line. For a pedigree founder, both parent names
61 are replaced by &quot;-&quot;. The founders are specified by a table in
62 gd_indivs format, e.g., as produced by &quot;Specify individuals&quot;
63 tool. Every founder must have genotypes supplied in the SNP table,
64 and both parents need to be given as &quot;-&quot; in the pedigree.
65 Conversely, every pedigree individual whose parents are &quot;-&quot;
66 must be named as a founder.
67
68 The user also provides two files that specify a set of names of
69 individuals. The first word on each line names an individual (one
70 line per individual); any subsequent information on a line is ignored.
71 The name of each individual must appear at the start of a line in the
72 pedigree.
73
74 For each pair of individuals, one from each specified set, the program
75 computes the expected heterozygosity of any offspring of the pair,
76 i.e., the probability that the offspring has distinct nucleotides at
77 a randomly chosen autosomal SNP. A SNP is ignorned if one or both
78 potential parents have an ancestor with undefined genotype (designated
79 as -1 in the SNP table).
80 </help>
81 </tool>