comparison false_discovery_rate.xml @ 2:cf0d72c7b482 draft

Update.
author galaxyp
date Fri, 10 May 2013 17:31:05 -0400
parents
children
comparison
equal deleted inserted replaced
1:5c65f8116244 2:cf0d72c7b482
1 <tool id="openms_false_discvoerage_rate" version="0.1.0" name="False Discovery Rate">
2 <description>
3 Tool to estimate the false discovery rate on peptide and protein level.
4 </description>
5 <macros>
6 <import>macros.xml</import>
7 </macros>
8 <expand macro="stdio" />
9 <expand macro="requires" />
10 <command interpreter="python">
11 openms_wrapper.py --executable 'FalseDiscoveryRate' --config $config
12 </command>
13 <configfiles>
14 <configfile name="config">[simple_options]
15 #if $input.type == "combined"
16 in=$input1
17 algorithm!decoy_string=$input.decoy_string
18 #else
19 fwd_in=$input.forward_input
20 rev_in=$input.forward_input
21 #end if
22 out=$output
23 algorithm!q_value=$q_value
24 algorithm!use_all_hits=$use_all_hits
25 algorithm!split_charge_variants=$split_charge_variants
26 algorithm!add_decoy_peptides=$add_decoy_peptides
27 </configfile>
28 </configfiles>
29 <inputs>
30 <conditional name="input">
31 <param name="type" label="Input Type" type="select">
32 <option value="combined">One search against combined target/decoy database</option>
33 <option value="separate">Two searches from separate target and decoy databases.</option>
34 </param>
35 <when value="combined">
36 <param name="input1" label="Identification Input" type="data" format="idxml" />
37 <param name="decoy_string" type="text" value="_rev" label="Decoy string"/>
38 </when>
39 <when value="separate">
40 <param name="forward_input" label="Forward Identification Input" type="data" format="idxml" />
41 <param name="reverse_input" label="Reverse Identification Input" type="data" format="idxml" />
42 </when>
43 </conditional>
44 <param name="q_value" label="Compute Q-Value" type="boolean" truevalue="true" falsevalue="false" checked="true" help="If 'true', the q-values will be calculated instead of the FDRs." />
45 <param name="use_all_hits" label="Use All Hits" type="boolean" truevalue="true" falsevalue="false" help="If 'true' not only the first hit, but all are used (peptides only)." />
46 <param name="split_charge_variants" label="Split Charge Variants" type="boolean" truevalue="true" falsevalue="false" help="If set to 'true' charge variants are treated separately (for peptides of combined target/decoy searches only)." checked="false" />
47 <param name="add_decoy_peptides" type="boolean" label="Add Decoy Peptides" help="If set to true, decoy peptides will be written to output file, too. The q-value is set to the closest target score." truevalue="true" falsevalue="false" checked="false" />
48
49 </inputs>
50 <outputs>
51 <data format="idxml" name="output" />
52 </outputs>
53 <help>
54 **What it does**
55
56 This TOPP tool can calculate the false discovery rate (FDR) given a forward and backward search (or one run on a combined database). Most useful is this on protein level, however, it also can be applied to peptides.
57
58 The false discovery rate is defined as the number of false discoveries (the hits in the reversed search) over the number of false and correct discoveries (the hits in both databases) with a score better than a given threshold.
59
60 Prerequisites:
61
62 When using a combined database of forward and reverse hits (thus only using one search run per ID engine), then use PeptideIndexer to index an idXML file generated by a search engine adapter, e.g. MascotAdapter. This will allow us to discern which peptides are from the target vs. decoy database.
63 When no decoy hits were found you will get a warning saying something like:
64 "FalseDiscoveryRate: #decoy sequences is zero! Setting all target sequences to q-value/FDR 0!"
65 This should be a serious concern, since the target/decoy annotation in a previous step has probably a misconfigured database (see PeptideIndexer).
66
67 **Citation**
68
69 For the underlying tool, please cite ``Marc Sturm, Andreas Bertsch, Clemens Gröpl, Andreas Hildebrandt, Rene Hussong, Eva Lange, Nico Pfeifer, Ole Schulz-Trieglaff, Alexandra Zerck, Knut Reinert, and Oliver Kohlbacher, 2008. OpenMS – an Open-Source Software Framework for Mass Spectrometry. BMC Bioinformatics 9: 163. doi:10.1186/1471-2105-9-163.``
70
71 If you use this tool in Galaxy, please cite Chilton J, et al. https://bitbucket.org/galaxyp/galaxyp-toolshed-openms
72 </help>
73 </tool>