annotate evaluate_population_numbers.xml @ 12:4b6590dd7250

Uploaded
author miller-lab
date Wed, 12 Sep 2012 17:10:26 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
12
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
1 <tool id="gd_evaluate_population_numbers" name="Evaluate" version="1.0.0">
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
2 <description>possible numbers of populations</description>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
3
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
4 <command interpreter="bash">
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
5 evaluate_population_numbers.bash "${input.extra_files_path}/admix.ped" "$output" "$max_populations"
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
6 </command>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
7
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
8 <inputs>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
9 <param name="input" type="data" format="gd_ped" label="Dataset" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
10 <param name="max_populations" type="integer" min="1" value="5" label="Maximum number of populations" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
11 </inputs>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
12
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
13 <outputs>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
14 <data name="output" format="txt" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
15 </outputs>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
16
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
17 <!--
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
18 <tests>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
19 <test>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
20 <param name="input" value="fake" ftype="gd_ped" >
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
21 <metadata name="base_name" value="admix" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
22 <composite_data value="test_out/prepare_population_structure/prepare_population_structure.html" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
23 <composite_data value="test_out/prepare_population_structure/admix.ped" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
24 <composite_data value="test_out/prepare_population_structure/admix.map" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
25 <edit_attributes type="name" value="fake" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
26 </param>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
27 <param name="max_populations" value="2" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
28
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
29 <output name="output" file="test_out/evaluate_population_numbers/evaluate_population_numbers.txt" />
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
30 </test>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
31 </tests>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
32 -->
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
33
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
34 <help>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
35 **What it does**
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
36
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
37 The users selects a set of data generated by the Galaxy tool to "prepare
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
38 to look for population structure". For all possible numbers K of ancestral
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
39 populations, from 1 up to a user-specified maximum, this tool produces values
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
40 that indicate how well the data can be explained as genotypes from individuals
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
41 derived from K ancestral populations. These values are computed by a 5-fold
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
42 cross-validation procedure, so that a good choice for K will exhibit a low
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
43 cross-validation error compared with other potential settings for K.
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
44
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
45 **Acknowledgments**
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
46
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
47 We use the program "Admixture", downloaded from
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
48
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
49 http://www.genetics.ucla.edu/software/admixture/
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
50
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
51 and described in the paper "Fast model-based estimation of ancestry in
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
52 unrelated individuals" by David H. Alexander, John Novembre and Kenneth Lange,
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
53 Genome Research 19 (2009), pp. 1655-1664. Admixture is called with the "--cv"
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
54 flag to produce these values.
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
55 </help>
4b6590dd7250 Uploaded
miller-lab
parents:
diff changeset
56 </tool>