5
|
1 <tool id="SAINT_preprocessing_v5" name="SAINT pre-processing">
|
|
2 <description></description>
|
|
3 <command interpreter="python">
|
|
4 #if (str($type) == 'Scaffold'):
|
|
5 SAINT_preprocessing.py $input $preybool $fasta_db $Inter_file $Prey_file
|
|
6 "
|
|
7 #for $ba in $bait
|
|
8 ${ba.bait1}
|
|
9 ${ba.assign}
|
|
10 ${ba.T_C}
|
|
11 #end for
|
|
12 "
|
|
13 $Bait_file \$INSTALL_RUN_PATH/ $bait_bool $bait_file_in
|
|
14 #elif (str($type) == 'MaxQuant'):
|
|
15 SAINT_preprocessing_mq_pep.py $input $preybool $fasta_db $Inter_file $Prey_file
|
|
16 "
|
|
17 #for $ba in $bait
|
|
18 ${ba.bait1}
|
|
19 ${ba.assign}
|
|
20 ${ba.T_C}
|
|
21 #end for
|
|
22 "
|
|
23 $Bait_file \$INSTALL_RUN_PATH/ $bait_bool $bait_file_in
|
|
24 #end if
|
|
25 </command>
|
|
26 <requirements>
|
|
27 <requirement type="set_environment">INSTALL_RUN_PATH</requirement>
|
|
28 <requirement type="package" version="3.2.1">package_r_3_2_1</requirement>
|
|
29 </requirements>
|
|
30 <inputs>
|
|
31 <param type="select" name="type" label="MaxQuant or Scaffold">
|
|
32 <option value="MaxQuant">MaxQuant</option>
|
|
33 <option value="Scaffold">Scaffold</option>
|
|
34 </param>
|
|
35 <param format="dat" name="input" type="data" label="Scaffold or MaxQuant proteinGroup Output"/>
|
|
36 <param type="boolean" name="preybool" checked="true" label="Create Prey File"/>
|
|
37 <param type="data" name="fasta_db" format="fasta" label="Provide Uniprot Fasta database" optional="true"/>
|
|
38 <param name="bait_bool" type="boolean" checked="true" label="Are You Providing Your Own bait file?"/>
|
|
39 <param type="data" format="dat" name="bait_file_in" label="Bait File" optional="true"/>
|
|
40 <repeat name="bait" title="Bait Create">
|
|
41 <param name="bait1" type="text" size="100" label="Bait"/>
|
|
42 <param name="assign" type="text" size="100" label="Group Assignment"/>
|
|
43 <param name="T_C" type="boolean" checked="true" label="Is this a Control?"/>
|
|
44 </repeat>
|
|
45
|
|
46 </inputs>
|
|
47 <outputs>
|
|
48 <data format="txt" name="Inter_file" label="Inter File"/>
|
|
49 <data format="txt" name="Prey_file" label="Prey File" />
|
|
50 <data format="txt" name="Bait_file" label="Bait File" />
|
|
51 </outputs>
|
|
52 <stdio>
|
|
53 <regex match="Error|error"
|
|
54 source="stdout"
|
|
55 level="fatal"
|
|
56 description="Unknown error"/>
|
|
57 <regex match="Error|error"
|
|
58 source="stderr"
|
|
59 level="fatal"
|
|
60 description="Unknown error"/>
|
|
61 <regex match="Error: bad bait"
|
|
62 source="stdout"
|
|
63 level="fatal"
|
|
64 description="Error: bad bait"/>
|
|
65 </stdio>
|
|
66
|
|
67 <tests>
|
|
68 <test>
|
|
69 <param name="input" value="fa_gc_content_input.fa"/>
|
|
70 <output name="out_file1" file="fa_gc_content_output.txt"/>
|
|
71 </test>
|
|
72 </tests>
|
|
73 <help>
|
21
|
74 Pre-Processing
|
|
75 ==============
|
5
|
76
|
21
|
77 --------------
|
20
|
78
|
21
|
79 This tool will read in a Scaffold *Samples Report* file (tab-delimited
|
|
80 txt file) or a MaxQuant *peptides.txt* file and process them to generate
|
|
81 a *Bait File, Prey File,* and *Inter File* for SAINTexpress analysis.
|
20
|
82
|
21
|
83 1) MaxQuant or Scaffold
|
23
|
84 -----------------------
|
20
|
85
|
21
|
86 APOSTL is able to recognize either a Scaffold *Samples Report* file
|
|
87 (tab-delimited txt file) or the *peptides.txt* file output in the
|
|
88 MaxQuant *txt* output folder. No modifications should be made to these
|
|
89 files. Please designate which one is being provided.
|
20
|
90
|
21
|
91 2) Scaffold or MaxQuant File Input
|
23
|
92 ----------------------------------
|
20
|
93
|
21
|
94 Select the corresponding *Samples Report* or *peptides.txt* here.
|
|
95
|
|
96 3) Create Prey File
|
23
|
97 -------------------
|
5
|
98
|
20
|
99 Select whether or not you would like APOSTL to generate a prey file.
|
|
100
|
21
|
101 When making a prey file, APOSTL queries a user provided FASTA database
|
|
102 (see below) in order to extract protein amino acid lengths and gene
|
|
103 names. This may take several minutes depending on your computer and if
|
|
104 your Galaxy distribution is cluster enabled. Some users may want to run
|
|
105 SAINTexpress using the same data set while changing which baits are
|
|
106 considered test or control. It is useful to toggle **Create Prey File**
|
|
107 off in order to save time by circumventing this step as the same prey
|
|
108 file can be used for both SAINTexpress runs.
|
20
|
109
|
21
|
110 4) Provide Uniprot FASTA database
|
23
|
111 ---------------------------------
|
20
|
112
|
21
|
113 Please specify a FASTA file containing the protein sequence. You can
|
|
114 download the latest release of Uniprot's reviewed sequences (Swiss-Prot)
|
22
|
115 at http://www.uniprot.org/downloads. For computing efficiency,
|
21
|
116 please limit your FASTA file to your species of interest. It is
|
|
117 generally advised to use the same FASTA database that was used for your
|
|
118 database searches during peptide and protein ID.
|
5
|
119
|
21
|
120 5) Are You Providing Your Own Bait File?
|
23
|
121 ----------------------------------------
|
20
|
122
|
21
|
123 Users have two options for providing a *Bait* file. If you would like to
|
|
124 create your own *Bait* file, select Yes and designate the file below. If
|
|
125 you would like to use the **Bait Create** tool to create a *Bait* file
|
|
126 within Galaxy, please select No and proceed to the **Bait Create** tool.
|
20
|
127
|
21
|
128 6) Bait File
|
23
|
129 ------------
|
21
|
130
|
|
131 If you selected Yes above, please specify the *Bait* file you would like
|
|
132 to provide. If you selected No above, ignore this argument and proceed
|
|
133 to the **Bait Create** tool.
|
5
|
134
|
21
|
135 **Note:** Individual bait names must match to the sample names
|
|
136 within your Scaffold or MaxQuant output exactly and must contain no
|
|
137 whitespace characters (e.g. spaces) or dashes.
|
5
|
138
|
21
|
139 7) Bait Create
|
23
|
140 --------------
|
21
|
141
|
|
142 Using the **Bait Create** tool, you can create your *bait.txt* file.
|
20
|
143
|
21
|
144 **Note:** Individual bait names must match to the sample names
|
|
145 within your Scaffold or MaxQuant output exactly and must contain no
|
|
146 whitespace characters (e.g. spaces) or dashes.
|
5
|
147
|
21
|
148 APOSTL uses this bait file to find the user's baits of interest within a
|
|
149 *Samples Report* or *peptides.txt* file and when preparing the
|
|
150 *inter.txt* file.
|
20
|
151
|
21
|
152 --------------
|
20
|
153
|
21
|
154 Once your parameters have been finalized, please press the Execute
|
|
155 button to start processing. This may take a few minutes. Once your
|
|
156 process has been completed, you will see your *Bait File, Prey File,*
|
|
157 and *Inter File* on the right hand side of your panel highlighted in
|
|
158 green.
|
5
|
159 </help>
|
|
160 </tool>
|