Mercurial > repos > peterjc > tmhmm_and_signalp
annotate tools/protein_analysis/signalp3.xml @ 17:e6cc27d182a8 draft
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
author | peterjc |
---|---|
date | Fri, 21 Nov 2014 08:19:09 -0500 |
parents | 7de64c8b258d |
children | eb6ac44d4b8e |
rev | line source |
---|---|
17
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
1 <tool id="signalp3" name="SignalP 3.0" version="0.0.14"> |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
2 <description>Find signal peptides in protein sequences</description> |
7
9b45a8743100
Uploaded v0.1.0, which adds a wrapper for Promoter 2.0 (DNA tool) and enables use of Galaxy's <parallelism> tag for SignalP, TMHMM X Promoter wrappers.
peterjc
parents:
6
diff
changeset
|
3 <!-- If job splitting is enabled, break up the query file into parts --> |
9b45a8743100
Uploaded v0.1.0, which adds a wrapper for Promoter 2.0 (DNA tool) and enables use of Galaxy's <parallelism> tag for SignalP, TMHMM X Promoter wrappers.
peterjc
parents:
6
diff
changeset
|
4 <!-- Using 2000 chunks meaning 4 threads doing 500 each is ideal --> |
9b45a8743100
Uploaded v0.1.0, which adds a wrapper for Promoter 2.0 (DNA tool) and enables use of Galaxy's <parallelism> tag for SignalP, TMHMM X Promoter wrappers.
peterjc
parents:
6
diff
changeset
|
5 <parallelism method="basic" split_inputs="fasta_file" split_mode="to_size" split_size="2000" merge_outputs="tabular_file"></parallelism> |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
6 <command interpreter="python"> |
17
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
7 signalp3.py $organism $truncate "\$GALAXY_SLOTS" $fasta_file $tabular_file |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
8 ##If the environment variable isn't set, get "", and the python wrapper |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
9 ##defaults to four threads. |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
10 </command> |
9 | 11 <stdio> |
12 <!-- Anything other than zero is an error --> | |
13 <exit_code range="1:" /> | |
14 <exit_code range=":-1" /> | |
15 </stdio> | |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
16 <inputs> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
17 <param name="fasta_file" type="data" format="fasta" label="FASTA file of protein sequences"/> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
18 <param name="organism" type="select" display="radio" label="Organism"> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
19 <option value="euk">Eukaryote</option> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
20 <option value="gram+">Gram positive</option> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
21 <option value="gram-">Gram negative</option> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
22 </param> |
4
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
23 <param name="truncate" type="integer" label="Truncate sequences to this many amino acids" value="70" help="Use zero for no truncation, maximum value 6000"> |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
24 <validator type="in_range" min="0" max="6000" message="Truncation value should be at most 6000. Use zero for no truncation."/> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
25 </param> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
26 </inputs> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
27 <outputs> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
28 <data name="tabular_file" format="tabular" label="SignalP $organism results" /> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
29 </outputs> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
30 <requirements> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
31 <requirement type="binary">signalp</requirement> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
32 </requirements> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
33 <tests> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
34 <test> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
35 <param name="fasta_file" value="four_human_proteins.fasta" ftype="fasta"/> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
36 <param name="organism" value="euk"/> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
37 <param name="truncate" value="0"/> |
1
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
38 <output name="tabular_file" file="four_human_proteins.signalp3.tabular" ftype="tabular"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
39 </test> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
40 <test> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
41 <param name="fasta_file" value="empty.fasta" ftype="fasta"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
42 <param name="organism" value="euk"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
43 <param name="truncate" value="60"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
44 <output name="tabular_file" file="empty_signalp3.tabular" ftype="tabular"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
45 </test> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
46 <test> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
47 <param name="fasta_file" value="empty.fasta" ftype="fasta"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
48 <param name="organism" value="gram+"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
49 <param name="truncate" value="80"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
50 <output name="tabular_file" file="empty_signalp3.tabular" ftype="tabular"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
51 </test> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
52 <test> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
53 <param name="fasta_file" value="empty.fasta" ftype="fasta"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
54 <param name="organism" value="gram-"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
55 <param name="truncate" value="0"/> |
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
56 <output name="tabular_file" file="empty_signalp3.tabular" ftype="tabular"/> |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
57 </test> |
4
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
58 <test> |
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
59 <param name="fasta_file" value="rxlr_win_et_al_2007.fasta" ftype="fasta"/> |
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
60 <param name="organism" value="euk"/> |
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
61 <param name="truncate" value="70"/> |
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
62 <output name="tabular_file" file="rxlr_win_et_al_2007_sp3.tabular" ftype="tabular"/> |
81caef04ce8b
Migrated tool version 0.0.7 from old tool shed archive to new tool shed repository
peterjc
parents:
3
diff
changeset
|
63 </test> |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
64 </tests> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
65 <help> |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
66 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
67 **What it does** |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
68 |
1
3ff1dcbb9440
Migrated tool version 0.0.3 from old tool shed archive to new tool shed repository
peterjc
parents:
0
diff
changeset
|
69 This calls the SignalP v3.0 tool for prediction of signal peptides, which uses both a Neural Network (NN) and Hidden Markov Model (HMM) to produce two sets of scores. |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
70 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
71 The input is a FASTA file of protein sequences, and the output is tabular with twenty columns (one row per protein): |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
72 |
11
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
73 ====== ================================================= |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
74 Column Description |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
75 ------ ------------------------------------------------- |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
76 1 Sequence identifier |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
77 2-14 Neural Network (NN) predictions (13 columns) |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
78 15-20 Hidden Markov Model (HMM) predictions (6 columns) |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
79 ====== ================================================= |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
80 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
81 Internally the input FASTA file is divided into parts (to allow multiple processors to be used), and the proteins truncated as specified (see below). The raw output from SignalP is then reformatted into a tabular layout suitable for Galaxy (see below). |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
82 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
83 **Neural Network Scores** |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
84 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
85 For each organism class (Eukaryote, Gram-negative and Gram-positive), two different neural networks are used, one for predicting the actual signal peptide and one for predicting the position of the signal peptidase I (SPase I) cleavage site. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
86 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
87 The NN output comprises three different scores (C-max, S-max and Y-max) and two scores derived from them (S-mean and D-score). |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
88 |
11
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
89 ====== ======= =============================================================== |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
90 Column Name Description |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
91 ------ ------- --------------------------------------------------------------- |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
92 2-4 C-score The C-score is the 'cleavage site' score. For each position in |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
93 the submitted sequence, a C-score is reported, which should |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
94 only be significantly high at the cleavage site. Confusion is |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
95 often seen with the position numbering of the cleavage site. |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
96 When a cleavage site position is referred to by a single number, |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
97 the number indicates the first residue in the mature protein, |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
98 meaning, that a predicted cleavage site between amino acid 26-27 |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
99 is reported as 27, corresponding to the mature protein starting |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
100 at (and including) position 27. |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
101 ------ ------- --------------------------------------------------------------- |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
102 5-7 S-score The S-score for the signal peptide prediction is calculated for |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
103 every single amino acid position in the submitted sequence (not |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
104 shown in the output via Galaxy), with high scores indicating |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
105 that the corresponding amino acid is part of a signal peptide, |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
106 and low scores indicating that the amino acid is part of a |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
107 mature protein. |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
108 ------ ------- --------------------------------------------------------------- |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
109 8-10 Y-max Y-max is a derivative of the C-score combined with the S-score |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
110 resulting in a better cleavage site prediction than the raw |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
111 C-score alone. This is due to the fact that multiple high-peaking |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
112 C-scores can be found in one sequence, where only one is the |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
113 true cleavage site. The cleavage site is assigned from the |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
114 Y-score where the slope of the S-score is steep and a |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
115 significant C-score is found. |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
116 ------ ------- --------------------------------------------------------------- |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
117 11-12 S-mean The S-mean is the average of the S-score, ranging from the |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
118 N-terminal amino acid to the amino acid assigned with the |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
119 highest Y-max score, thus the S-mean score is calculated for |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
120 the length of the predicted signal peptide. The S-mean score |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
121 was in SignalP version 2.0 used as the criteria for |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
122 discrimination of secretory and non-secretory proteins. |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
123 ------ ------- --------------------------------------------------------------- |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
124 13-14 D-score The D-score was introduced in SignalP version 3.0 and is a |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
125 simple average of the S-mean and Y-max score. The score shows |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
126 superior discrimination performance of secretory and |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
127 non-secretory proteins to that of the S-mean score which was |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
128 used in SignalP version 1 and 2. |
99b82a2b1272
Uploaded v0.2.0 which added PSORTb wrapper (written with Konrad Paszkiewicz)
peterjc
parents:
9
diff
changeset
|
129 ====== ======= =============================================================== |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
130 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
131 For non-secretory proteins all the scores represented in the SignalP3-NN output should ideally be very low. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
132 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
133 **Hidden Markov Model Scores** |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
134 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
135 The hidden Markov model calculates the probability of whether the submitted sequence contains a signal peptide or not. The eukaryotic HMM model also reports the probability of a signal anchor, previously named uncleaved signal peptides. Furthermore, the cleavage site is assigned by a probability score together with scores for the n-region, h-region, and c-region of the signal peptide, if such one is found. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
136 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
137 The 'type' column uses 'S' for a signal peptide (i.e. secretory protein) and 'Q' for non-secretory protein. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
138 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
139 **Notes** |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
140 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
141 The raw output 'short' output from TMHMM v2.0 looks something like this (21 columns space separated - shown here formatted nicely). Notice that the identifiers are given twice, the first time truncated (as part of the NN predictions) and the second time in full (in the HMM predictions). |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
142 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
143 ==================== ===== === = ===== === = ===== === = ===== = ===== = =================================== = ===== === = ===== = |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
144 # SignalP-NN euk predictions # SignalP-HMM euk predictions |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
145 ----------------------------------------------------------------------------- ------------------------------------------------------------ |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
146 # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? D ? # name ! Cmax pos ? Sprob ? |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
147 gi|2781234|pdb|1JLY| 0.061 17 N 0.043 17 N 0.199 1 N 0.067 N 0.055 N gi|2781234|pdb|1JLY|B Q 0.000 17 N 0.000 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
148 gi|4959044|gb|AAD342 0.099 191 N 0.012 38 N 0.023 12 N 0.014 N 0.013 N gi|4959044|gb|AAD34209.1|AF069992_1 Q 0.000 0 N 0.000 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
149 gi|671626|emb|CAA856 0.139 381 N 0.020 8 N 0.121 4 N 0.067 N 0.044 N gi|671626|emb|CAA85685.1| Q 0.000 0 N 0.000 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
150 gi|3298468|dbj|BAA31 0.208 24 N 0.184 38 N 0.980 32 Y 0.613 Y 0.398 N gi|3298468|dbj|BAA31520.1| Q 0.066 24 N 0.139 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
151 ==================== ===== === = ===== === = ===== === = ===== = ===== = =================================== = ===== === = ===== = |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
152 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
153 In order to make this easier to use in Galaxy, the wrapper script simplifies this to remove the redundant column and use tabs for separation. It also includes a header line with unique column names. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
154 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
155 =================================== ============= =========== ============ ============= =========== ============ ============= =========== ============ ============== ============= ========== ========= ======== ============== ============ ============= =============== ============== |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
156 #ID NN_Cmax_score NN_Cmax_pos NN_Cmax_pred NN_Ymax_score NN_Ymax_pos NN_Ymax_pred NN_Smax_score NN_Smax_pos NN_Smax_pred NN_Smean_score NN_Smean_pred NN_D_score NN_D_pred HMM_type HMM_Cmax_score HMM_Cmax_pos HMM_Cmax_pred HMM_Sprob_score HMM_Sprob_pred |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
157 gi|2781234|pdb|1JLY|B 0.061 17 N 0.043 17 N 0.199 1 N 0.067 N 0.055 N Q 0.000 17 N 0.000 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
158 gi|4959044|gb|AAD34209.1|AF069992_1 0.099 191 N 0.012 38 N 0.023 12 N 0.014 N 0.013 N Q 0.000 0 N 0.000 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
159 gi|671626|emb|CAA85685.1| 0.139 381 N 0.020 8 N 0.121 4 N 0.067 N 0.044 N Q 0.000 0 N 0.000 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
160 gi|3298468|dbj|BAA31520.1| 0.208 24 N 0.184 38 N 0.980 32 Y 0.613 Y 0.398 N Q 0.066 24 N 0.139 N |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
161 =================================== ============= =========== ============ ============= =========== ============ ============= =========== ============ ============== ============= ========== ========= ======== ============== ============ ============= =============== ============== |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
162 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
163 **Truncation** |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
164 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
165 Signal peptides are found at the start of a protein, so there is limited value in providing the full length sequence, and providing the full sequence slows down the analysis. Furthermore, SignalP has an upper bound on the sequence length it will accept (6000bp). Thus for practical reasons it is useful to truncate the proteins before passing them to SignalP. However, the precise point they are truncated does have a small influence on some score values, and thus to the results. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
166 |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
167 **References** |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
168 |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
169 If you use this Galaxy tool in work leading to a scientific publication please |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
170 cite the following papers: |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
171 |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
172 Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013). |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
173 Galaxy tools and workflows for sequence analysis with applications |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
174 in molecular plant pathology. PeerJ 1:e167 |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
175 http://dx.doi.org/10.7717/peerj.167 |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
176 |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
177 Bendtsen, Nielsen, von Heijne, and Brunak (2004). |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
178 Improved prediction of signal peptides: SignalP 3.0. |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
179 J. Mol. Biol., 340:783-795. |
6
a290c6d4e658
Migrated tool version 0.0.9 from old tool shed archive to new tool shed repository
peterjc
parents:
4
diff
changeset
|
180 http://dx.doi.org/10.1016/j.jmb.2004.05.028 |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
181 |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
182 Nielsen, Engelbrecht, Brunak and von Heijne (1997). |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
183 Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
184 Protein Engineering, 10:1-6. |
6
a290c6d4e658
Migrated tool version 0.0.9 from old tool shed archive to new tool shed repository
peterjc
parents:
4
diff
changeset
|
185 http://dx.doi.org/10.1093/protein/10.1.1 |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
186 |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
187 Nielsen and Krogh (1998). |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
188 Prediction of signal peptides and signal anchors by a hidden Markov model. |
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
189 Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology (ISMB 6), |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
190 AAAI Press, Menlo Park, California, pp. 122-130. |
6
a290c6d4e658
Migrated tool version 0.0.9 from old tool shed archive to new tool shed repository
peterjc
parents:
4
diff
changeset
|
191 http://www.ncbi.nlm.nih.gov/pubmed/9783217 |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
192 |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
193 See also http://www.cbs.dtu.dk/services/SignalP-3.0/output.php |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
194 |
16
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
195 This wrapper is available to install into other Galaxy Instances via the Galaxy |
7de64c8b258d
Uploaded v0.2.5, MIT licence, RST for README, citation information, development moved to GitHub
peterjc
parents:
11
diff
changeset
|
196 Tool Shed at http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
197 </help> |
17
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
198 <citations> |
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
199 <citation type="doi">10.7717/peerj.167</citation> |
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
200 <citation type="doi">10.1016/j.jmb.2004.05.028</citation> |
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
201 <citation type="doi">10.1093/protein/10.1.1</citation> |
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
202 <!-- TODO - Add bibtex entry for PMID: 9783217 --> |
e6cc27d182a8
Uploaded v0.2.6, embedded citations and uses $GALAXY_SLOTS
peterjc
parents:
16
diff
changeset
|
203 </citations> |
0
bca9bc7fdaef
Migrated tool version 0.0.1 from old tool shed archive to new tool shed repository
peterjc
parents:
diff
changeset
|
204 </tool> |