annotate rankfilterGCMS_tabular.xml @ 3:86d41d9c4a91

removed msclust images
author pieter.lukasse@wur.nl
date Thu, 19 Mar 2015 12:09:38 +0100
parents dffc38727496
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
1 <tool id="rankfilterGCMS_tabular" name="RIQC-RankFilter GC-MS from tabular file" version="1.0.2">
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
2 <description>Convert Retention Time to Retention Index</description>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
3 <command interpreter="python">rankfilter_GCMS/rankfilter.py $input_file</command>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
4 <inputs>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
5 <param format="tabular" name="sample" type="data" label="Sample File"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
6 help="Select a tab delimited NIST metabolite identifications file (converted from PDF)" />
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
7 <!-- question: is this calibration file not column specific as it includes RT info?? -->
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
8 <!-- this one should be input file for now:<param name="calibration" type="select" label="Calibration File"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
9 help="Calibration file with reference masses (e.g. alkanes) with their RT and RI values"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
10 dynamic_options='get_directory_files("tool-data/shared/PRIMS-metabolomics/RankFilter_Calibration_Files")'/>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
11 -->
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
12 <param name="calibration" format="any" type="data" label="Calibration File"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
13 help="Calibration file containing reference masses (e.g. alkanes) with their respective RT and RI values"/>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
14
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
15 <param name="analysis_type" type="select" format="text" label="Analysis Type"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
16 help="Select the type of analysis that has been used to generate the sample file">
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
17 <option value="NIST">NIST</option>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
18 <option value="AMDIS">AMDIS</option>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
19 </param>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
20 <param name="model" type="select" format="text" label="Select a model to be used "
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
21 help="Both linear and (3rd degree) polynomial models are available ">
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
22 <option value="linear">Linear</option>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
23 <option value="poly">Polynomial</option>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
24 </param>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
25 <param name="lib_data" type="select" label="Library"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
26 help="Reference global lookup library file with CAS numbers and respective (previously calculated) RIsvr values"
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
27 dynamic_options='get_directory_files("tool-data/shared/PRIMS-metabolomics/RankFilter_lookup_libraries")'/>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
28
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
29 <param name="window" type="float" label="Window" value="10.56" />
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
30 </inputs>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
31 <outputs>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
32 <data format="tabular" label="${tool.name}" name="onefile" />
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
33 </outputs>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
34 <!-- file with implementation of the function get_directory_files() used above -->
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
35 <code file="match_library.py" />
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
36 <configfiles>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
37 <configfile name="input_file">
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
38 sample = ${sample}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
39 calibration = ${calibration}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
40 lib_data = ${lib_data}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
41 window = ${window}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
42 analysis_type = ${analysis_type}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
43 tabular = True
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
44 onefile = ${onefile}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
45 model = ${model}
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
46 </configfile>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
47 </configfiles>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
48 <help>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
49 Basically estimates the experimental RI (RIexp) by building a RI(RT) function based on the
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
50 given calibration file.
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
51
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
52 It also determines the estimated RI (RIsvr) by looking up for each entry of the given input file (Sample File),
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
53 based on its CAS number, its respective RIsvr value in the given global lookup library
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
54 (this step is also called the "RankFilter analysis" -see reference below; Sample File may be either from NIST or AMDIS).
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
55 This generates an prediction of the RI for
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
56 a compound according to the "RankFilter procedure" (RIsvr).
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
57
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
58 Output is a tab separated file in which four columns are added:
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
59
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
60 - **Rank** Calculated rank
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
61 - **RIexp** Experimental Retention Index (RI)
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
62 - **RIsvr** Calculated RI based on support vector regression (SVR)
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
63 - **%rel.err** Relative RI error (%rel.error = 100 * (RISVR − RIexp) / RIexp)
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
64
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
65 .. class:: infomark
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
66
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
67 **Notes**
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
68
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
69 - The layout of the Calibration file should include the following columns: 'MW', 'R.T.' and 'RI'.
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
70 - Selecting 'Polynomial' in the model parameter will calculate a 3rd degree polynomial model that will
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
71 be used to convert from XXXX to YYYY.
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
72
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
73 -----
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
74
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
75 **References**
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
76
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
77 - **RankFilter**: Mihaleva et. al. (2009) *Automated procedure for candidate compound selection in GC-MS
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
78 metabolomics based on prediction of Kovats retention index*. Bioinformatics, 25 (2009), pp. 787–794
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
79 </help>
dffc38727496 initial commit
pieter.lukasse@wur.nl
parents:
diff changeset
80 </tool>