annotate TO_GALAXY/tools/ARTS/galaxy_arts_score.xml @ 1:2086dd919b31 draft

Uploaded
author mmaiensc
date Wed, 13 Nov 2013 16:28:55 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
1
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
1 <tool id="ARTSscore" name="ARTS Score">
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
2 <description>compute the score for a study randomization</description>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
3 <command interpreter="perl">ARTS.pl -i $input -p $batch -c "$column" -cc $conts -cd $dates -cb $bins -mmi -v l > $out </command>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
4 <inputs>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
5 <param name="input" type="data" format="tabular" label="Input traits per sample" help="Ensure input is formatted as tabular"/>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
6 <param name="batch" type="data_column" data_ref="input" multiple="False" numerical="False" label="Batch column to use" help="Select which column corresponds to the batching you want to score." />
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
7 <param name="column" type="data_column" data_ref="input" multiple="True" numerical="False" label="Trait columns" help="Multi-select list - hold the appropriate key while clicking to select multiple columns." />
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
8 <param name="conts" type="data_column" data_ref="input" multiple="True" numerical="False" optional="True" label="Continuous- and date-valued columns for binning (if any)" help="Multi-select list. Values should be numbers." />
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
9 <param name="dates" type="data_column" data_ref="input" multiple="True" numerical="False" optional="True" label="Date-valued columns for binning (if any)" help="Multi-select list. Dates should be M/D/Y, where M, D, and Y are all integers (e.g., 7/9/1985)." />
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
10 <param name="bins" type="text" size="40" label="Bin sizes (for continuously-valued columns)" value="5" optional="False" help="Set to a single number, or a comma-delimited list. If given as a list, will be used in same order as continuous columns."/>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
11 </inputs>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
12 <outputs>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
13 <data format="tabular" name="out" />
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
14 </outputs>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
15 <help>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
16
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
17 **Purpose**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
18
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
19 This tool computes the score for a completed study randomization (e.g., by ARTS) for a selected number of traits over the samples in your data, and a particular column giving the batch assignments. The output here is identical to the stdout obtained from a standard ARTS run.
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
20
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
21 -----
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
22
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
23 **Input traits per sample**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
24
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
25 - A list of traits associated with each sample, including a header line giving the name of each type of trait, and a batch column. For example::
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
26
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
27 ID Sex Age Sample date Diseased Batch
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
28 Sample1 M 15 6/7/2011 Y 1
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
29 Sample2 M 25 8/5/2012 Y 2
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
30 Sample3 F 23 1/30/2012 N 1
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
31 Sample4 F 45 4/1/2013 N 1
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
32 Sample5 M 52 3/21/2011 Y 2
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
33 Sample6 F 37 3/12/2013 N 2
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
34 Sample7 M 31 7/17/2011 N 2
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
35
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
36 -----
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
37
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
38 **Batch column to use**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
39
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
40 - Which column indicate the batch assignment. In the example above, this would be c6 (batch).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
41
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
42 -----
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
43
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
44 **Traits to randomize**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
45
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
46 - Which traits should be randomized. On Macs, hold command to multi-select. You do not need to select all columns (it would be silly, for example, to randomize over sample ID).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
47
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
48 - Note missing values for traits will be treated as an additional trait value (i.e., empty).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
49
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
50 - For the example above, we would select c2, c3, c4, and c5 (Sex, Age, Sample date, and Diseased). Not all traits need be selected, just the relevant ones (we may not care about Sample date, for example).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
51
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
52 -----
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
53
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
54 **Continuous- and date-valued columns (optional)**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
55
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
56 - Use if you have columns with continuous values (e.g., age, blood pressure) or dates. They will be discretized prior to running.
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
57
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
58 - For the example above, we would select c3 and c4 (Age, Sample date).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
59
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
60 -----
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
61
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
62 **Date-valued columns (optional)**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
63
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
64 - Use if any of the columns selected as continuous are dates (MUST be formatted M/D/Y, where month is a number, for example 7/9/1985).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
65
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
66 - For the example above, we would select c4 (Sample date).
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
67
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
68 -----
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
69
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
70 **Bin sizes**
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
71
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
72 - This only relates to any columns selected as continuous, and determines how many discrete bins the data will be split up in to.
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
73
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
74 - You can set it to a single number, and all columns will use that number of bins. Or you can set it to a list of numbers to specify a different number of bins for each column.
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
75
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
76 - For the example above, where we selected c3 and c4 as continuous, we could set::
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
77
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
78 Bin sizes=5,6
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
79
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
80 - which would split the Age column (c3) into 5 bins, and the Sample date column (c4) into 6 bins.
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
81
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
82
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
83 </help>
2086dd919b31 Uploaded
mmaiensc
parents:
diff changeset
84 </tool>