annotate ARTS/galaxy_arts_score.xml @ 0:3723b54935cb draft

Uploaded
author mmaiensc
date Wed, 13 Nov 2013 16:13:17 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
1 <tool id="ARTSscore" name="ARTS Score">
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
2 <description>compute the score for a study randomization</description>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
3 <command interpreter="perl">ARTS.pl -i $input -p $batch -c "$column" -cc $conts -cd $dates -cb $bins -mmi -v l > $out </command>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
4 <inputs>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
5 <param name="input" type="data" format="tabular" label="Input traits per sample" help="Ensure input is formatted as tabular"/>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
6 <param name="batch" type="data_column" data_ref="input" multiple="False" numerical="False" label="Batch column to use" help="Select which column corresponds to the batching you want to score." />
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
7 <param name="column" type="data_column" data_ref="input" multiple="True" numerical="False" label="Trait columns" help="Multi-select list - hold the appropriate key while clicking to select multiple columns." />
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
8 <param name="conts" type="data_column" data_ref="input" multiple="True" numerical="False" optional="True" label="Continuous- and date-valued columns for binning (if any)" help="Multi-select list. Values should be numbers." />
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
9 <param name="dates" type="data_column" data_ref="input" multiple="True" numerical="False" optional="True" label="Date-valued columns for binning (if any)" help="Multi-select list. Dates should be M/D/Y, where M, D, and Y are all integers (e.g., 7/9/1985)." />
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
10 <param name="bins" type="text" size="40" label="Bin sizes (for continuously-valued columns)" value="5" optional="False" help="Set to a single number, or a comma-delimited list. If given as a list, will be used in same order as continuous columns."/>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
11 </inputs>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
12 <outputs>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
13 <data format="tabular" name="out" />
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
14 </outputs>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
15 <help>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
16
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
17 **Purpose**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
18
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
19 This tool computes the score for a completed study randomization (e.g., by ARTS) for a selected number of traits over the samples in your data, and a particular column giving the batch assignments. The output here is identical to the stdout obtained from a standard ARTS run.
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
20
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
21 -----
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
22
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
23 **Input traits per sample**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
24
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
25 - A list of traits associated with each sample, including a header line giving the name of each type of trait, and a batch column. For example::
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
26
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
27 ID Sex Age Sample date Diseased Batch
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
28 Sample1 M 15 6/7/2011 Y 1
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
29 Sample2 M 25 8/5/2012 Y 2
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
30 Sample3 F 23 1/30/2012 N 1
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
31 Sample4 F 45 4/1/2013 N 1
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
32 Sample5 M 52 3/21/2011 Y 2
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
33 Sample6 F 37 3/12/2013 N 2
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
34 Sample7 M 31 7/17/2011 N 2
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
35
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
36 -----
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
37
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
38 **Batch column to use**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
39
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
40 - Which column indicate the batch assignment. In the example above, this would be c6 (batch).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
41
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
42 -----
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
43
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
44 **Traits to randomize**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
45
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
46 - Which traits should be randomized. On Macs, hold command to multi-select. You do not need to select all columns (it would be silly, for example, to randomize over sample ID).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
47
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
48 - Note missing values for traits will be treated as an additional trait value (i.e., empty).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
49
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
50 - For the example above, we would select c2, c3, c4, and c5 (Sex, Age, Sample date, and Diseased). Not all traits need be selected, just the relevant ones (we may not care about Sample date, for example).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
51
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
52 -----
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
53
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
54 **Continuous- and date-valued columns (optional)**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
55
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
56 - Use if you have columns with continuous values (e.g., age, blood pressure) or dates. They will be discretized prior to running.
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
57
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
58 - For the example above, we would select c3 and c4 (Age, Sample date).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
59
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
60 -----
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
61
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
62 **Date-valued columns (optional)**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
63
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
64 - Use if any of the columns selected as continuous are dates (MUST be formatted M/D/Y, where month is a number, for example 7/9/1985).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
65
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
66 - For the example above, we would select c4 (Sample date).
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
67
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
68 -----
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
69
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
70 **Bin sizes**
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
71
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
72 - This only relates to any columns selected as continuous, and determines how many discrete bins the data will be split up in to.
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
73
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
74 - You can set it to a single number, and all columns will use that number of bins. Or you can set it to a list of numbers to specify a different number of bins for each column.
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
75
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
76 - For the example above, where we selected c3 and c4 as continuous, we could set::
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
77
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
78 Bin sizes=5,6
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
79
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
80 - which would split the Age column (c3) into 5 bins, and the Sample date column (c4) into 6 bins.
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
81
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
82
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
83 </help>
3723b54935cb Uploaded
mmaiensc
parents:
diff changeset
84 </tool>