comparison prepare_population_structure.xml @ 17:a3af29edcce2

Uploaded Miller Lab Devshed version a51c894f5bed
author miller-lab
date Fri, 28 Sep 2012 11:57:18 -0400
parents 8ae67e9fb6ff
children f04f40a36cc8
comparison
equal deleted inserted replaced
16:be0e2223c531 17:a3af29edcce2
1 <tool id="gd_prepare_population_structure" name="Prepare Input" version="1.0.0">
2 <description>: Filter and convert to the format needed for these tools</description>
3
4 <command interpreter="python">
5 prepare_population_structure.py "$input" "$min_reads" "$min_qual" "$min_spacing" "$output" "$output.files_path"
6 #if $individuals.choice == '0'
7 "all_individuals"
8 #else if $individuals.choice == '1'
9 #for $population in $individuals.populations
10 #set $pop_arg = 'population:%s:%s' % (str($population.p_input), str($population.p_input.name))
11 "$pop_arg"
12 #end for
13 #end if
14 #for $individual, $individual_col in zip($input.dataset.metadata.individual_names, $input.dataset.metadata.individual_columns)
15 #set $arg = 'individual:%s:%s' % ($individual_col, $individual)
16 "$arg"
17 #end for
18 </command>
19
20 <inputs>
21 <param name="input" type="data" format="gd_snp" label="SNP dataset" />
22 <param name="min_reads" type="integer" min="0" value="0" label="Minimum reads covering a SNP, per individual" />
23 <param name="min_qual" type="integer" min="0" value="0" label="Minimum quality value, per individual" />
24 <param name="min_spacing" type="integer" min="0" value="0" label="Minimum spacing between SNPs on the same scaffold" />
25 <conditional name="individuals">
26 <param name="choice" type="select" label="Individuals">
27 <option value="0" selected="true">All</option>
28 <option value="1">Choose</option>
29 </param>
30 <when value="0" />
31 <when value="1">
32 <repeat name="populations" title="Population" min="1">
33 <param name="p_input" type="data" format="gd_indivs" label="Individuals" />
34 </repeat>
35 </when>
36 </conditional>
37 </inputs>
38
39 <outputs>
40 <data name="output" format="gd_ped">
41 <actions>
42 <action type="metadata" name="base_name" default="admix" />
43 </actions>
44 </data>
45 </outputs>
46
47 <tests>
48 <test>
49 <param name="input" value="test_in/sample.gd_snp" ftype="gd_snp" />
50 <param name="min_reads" value="3" />
51 <param name="min_qual" value="30" />
52 <param name="min_spacing" value="0" />
53 <param name="choice" value="0" />
54 <output name="output" file="test_out/prepare_population_structure/prepare_population_structure.html" ftype="html" compare="diff" lines_diff="2">
55 <extra_files type="file" name="admix.map" value="test_out/prepare_population_structure/admix.map" />
56 <extra_files type="file" name="admix.ped" value="test_out/prepare_population_structure/admix.ped" />
57 </output>
58 </test>
59 </tests>
60
61 <help>
62
63 **Dataset formats**
64
65 The input datasets are in gd_snp_ and gd_indivs_ formats. It is important
66 for the Individuals datasets to have unique names; rename them if
67 necessary to make them unique. These names are used by the later tools in
68 the graphical displays.
69 The output dataset is gd_ped_. (`Dataset missing?`_)
70
71 .. _gd_snp: ./static/formatHelp.html#gd_snp
72 .. _gd_indivs: ./static/formatHelp.html#gd_indivs
73 .. _gd_ped: ./static/formatHelp.html#gd_ped
74 .. _Dataset missing?: ./static/formatHelp.html
75
76 -----
77
78 **What it does**
79
80 The tool converts a gd_snp dataset into two tables, called "admix.map" and
81 "admix.ped", needed for estimating the population structure. The user
82 can read or download those files, or simply pass this tool's output on to
83 other programs. The user imposes conditions on which SNPs to consider,
84 such as the minimum coverage and/or quality value for every individual,
85 or the distance to the closest SNP in the same contig (as named in the
86 first column of the SNP table). A useful piece of information produced
87 by the tool is the number of SNPs meeting those conditions, which can
88 be found by clicking on the eye icon in the history panel after the program
89 runs.
90
91 -----
92
93 **Example**
94
95 - input::
96
97 Contig161_chr1_4641264_4641879 115 C T 73.5 chr1 4641382 C 6 0 2 45 8 0 2 51 15 0 2 72 5 0 2 42 6 0 2 45 10 0 2 57 Y 54 0.323 0
98 Contig48_chr1_10150253_10151311 11 A G 94.3 chr1 10150264 A 1 0 2 30 1 0 2 30 1 0 2 30 3 0 2 36 1 0 2 30 1 0 2 30 Y 22 +99. 0
99 Contig20_chr1_21313469_21313570 66 C T 54.0 chr1 21313534 C 4 0 2 39 4 0 2 39 5 0 2 42 4 0 2 39 4 0 2 39 5 0 2 42 N 1 +99. 0
100 etc.
101
102 - output map file::
103
104 1 snp1 0 2
105 1 snp3 0 4
106 1 snp4 0 5
107 1 snp5 0 6
108 1 snp6 0 7
109 1 snp7 0 8
110 1 snp8 0 9
111 1 snp9 0 10
112
113 - output ped file::
114
115 PB1 1 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
116
117 </help>
118 </tool>