6
|
1 <tool id="saint_wrapper" name="SAINTexpress">
|
|
2 <description></description>
|
|
3 <command interpreter="python">saint_wrapper.py $inter_input $prey_input $bait_input $rep_num $vc_bool $vc_num $go_bool $go_file $SAINT_output \$INSTALL_RUN_PATH/</command>
|
|
4 <requirements>
|
|
5 <requirement type="set_environment">INSTALL_RUN_PATH</requirement>
|
|
6 </requirements>
|
|
7 <inputs>
|
|
8 <param format="txt" name="inter_input" type="data" label="Interactions File"/>
|
|
9 <param format="txt" name="prey_input" type="data" label="Prey File"/>
|
|
10 <param format="txt" name="bait_input" type="data" label="Bait File"/>
|
|
11 <param name="rep_num" type="integer" value="100" label="Number of Replicates" help="the number of replicates (with largest spectral counts or intensities) to be used for probability calculation in each bait. This option is useful when some baits have more replicates than others." />
|
|
12 <param type="boolean" name="vc_bool" checked="false" label="Using Virtual Control Purifications by Compression"/>
|
|
13 <param name="vc_num" type="integer" value="1" label="Number of Virtual Controls" />
|
|
14 <param type="boolean" name="go_bool" checked="false" label="Incorporate External Data Sources" />
|
|
15 <param format="txt" name="go_file" type="data" optional="true" label="External Data File" help="SAINT can be run with the known interaction data in hand and it will incorporate the number of observed interactions into the statistics to boost identification probability."/>
|
|
16 </inputs>
|
|
17 <outputs>
|
|
18 <data format="txt" name="SAINT_output" />
|
|
19 </outputs>
|
|
20 <stdio>
|
|
21 <regex match="Error|error"
|
|
22 source="stdout"
|
|
23 level="fatal"
|
|
24 description="Unknown error"/>
|
|
25 <regex match="Error|error"
|
|
26 source="stderr"
|
|
27 level="fatal"
|
|
28 description="Unknown error"/>
|
|
29 <regex match="Error: bad bait"
|
|
30 source="stdout"
|
|
31 level="fatal"
|
|
32 description="Error: bad bait"/>
|
|
33 </stdio>
|
|
34
|
|
35 <tests>
|
|
36 <test>
|
|
37 <param name="input" value="fa_gc_content_input.fa"/>
|
|
38 <output name="out_file1" file="fa_gc_content_output.txt"/>
|
|
39 </test>
|
|
40 </tests>
|
|
41 <help>
|
8
|
42 SAINTexpress
|
|
43 ------------
|
|
44
|
|
45 This tool will read in a *Bait File, Prey File,* and *Inter File* and
|
|
46 analyze them using the latest version of SAINTexpress.
|
6
|
47
|
8
|
48 **1) Interactions File**
|
|
49
|
|
50 Please specify an *Inter* file for SAINTexpress analysis. An *Inter*
|
|
51 file should contain four columns: IP name, bait name, prey name, and
|
|
52 spectral counts or intensity values, depending on the mode of
|
|
53 quantification.
|
6
|
54
|
8
|
55 **2) Prey File**
|
|
56
|
|
57 Please specify a *Prey* file for SAINTexpress analysis. A *Prey* file
|
|
58 should contain three columns: prey (protein) name, prey amino acid
|
|
59 length, and prey gene name.
|
6
|
60
|
8
|
61 **3) Bait File**
|
|
62
|
|
63 Please specify a *Bait* file for SAINTexpress analysis. A *Bait* file
|
|
64 should have three columns: IP name, bait name, and the indicator for
|
|
65 test and negative control purifications.
|
|
66
|
|
67 **4) Number of Replicates**
|
6
|
68
|
8
|
69 The number of replicates (with largest spectral counts or intensities)
|
|
70 to be used for probability calculation in each bait. This option is
|
|
71 useful when some baits have more replicates than others.
|
|
72
|
|
73 **5) Using Virtual Control Purifications by Compression**
|
6
|
74
|
8
|
75 Sets the number of virtual control purifications by compression. For
|
|
76 instance, if you wish to take 4 largest spectral counts for controls,
|
|
77 set **Number of Vitual Controls** to 4 and this variable to Yes.
|
|
78
|
|
79 **6) Number of Virtual Controls**
|
|
80
|
|
81 Specify the number of virtual control purifications by compression.
|
6
|
82
|
8
|
83 **7) Incorporate External Data Sources**
|
6
|
84
|
8
|
85 To incorporate external data sources for computing the Topology-Aware
|
|
86 Probability Score (TopoAvgP), the user must also provide the interaction
|
|
87 database file that contains two columns: interaction identifier column
|
|
88 and interaction/grouping information column. The first column is just
|
|
89 for formality and thus can be filled in with anything (no white space)
|
|
90 and it will not be utilized in the scoring. The second column must be
|
|
91 formatted as a list of prey identifiers (consistent with the first
|
|
92 column of the prey file) separated by a white space.
|
6
|
93
|
8
|
94 **8) External Data File**
|
|
95
|
|
96 Specify external data file
|
|
97
|
|
98 --------
|
6
|
99
|
|
100 **Please cite**
|
|
101
|
|
102 * Choi, H., Larsen, B., Lin., Z.-Y., Breitkreutz, A., Mellacheruvu, D., Fermin, D., Qin, Z.S., Tyers, M., Gingras, A.-C. and Nesvizhskii, A.I. (2011) SAINT: probabilistic scoring of affinity purification - mass spectrometry data. Nature Methods, 8:70-3.
|
|
103
|
|
104 </help>
|
|
105 </tool>
|