Mercurial > repos > eschen42 > w4mcorcov
comparison w4mcorcov.xml @ 6:7bd523ca1f9a draft
planemo upload for repository https://github.com/HegemanLab/w4mcorcov_galaxy_wrapper/tree/master commit cafda5095a79ce2376325b57337302f95137195d
author | eschen42 |
---|---|
date | Wed, 18 Jul 2018 12:35:55 -0400 |
parents | 50f60f94c034 |
children | 066b1f409e9f |
comparison
equal
deleted
inserted
replaced
5:50f60f94c034 | 6:7bd523ca1f9a |
---|---|
1 <tool id="w4mcorcov" name="OPLS-DA_Contrasts" version="0.98.8"> | 1 <tool id="w4mcorcov" name="OPLS-DA_Contrasts" version="0.98.10"> |
2 | 2 |
3 <description>OPLS-DA Contrasts of Univariate Results</description> | 3 <description>OPLS-DA Contrasts of Univariate Results</description> |
4 | 4 |
5 <macros> | 5 <macros> |
6 <xml name="paramPairSigFeatOnly"> | 6 <xml name="paramPairSigFeatOnly"> |
7 <param | 7 <param |
8 name="pairSigFeatOnly" | 8 name="pairSigFeatOnly" |
9 type="boolean" | 9 type="boolean" |
11 truevalue="TRUE" | 11 truevalue="TRUE" |
12 falsevalue="FALSE" | 12 falsevalue="FALSE" |
13 label="Retain only pairwise-significant features" | 13 label="Retain only pairwise-significant features" |
14 help="When this option is set to 'Yes', analysis will be performed including only features that differ significantly for the pair of levels being contrasted; when set to 'No', any feature that varies significantly across all levels will be included (i.e., exclude any feature that is not significantly different across all levels). See examples below." /> | 14 help="When this option is set to 'Yes', analysis will be performed including only features that differ significantly for the pair of levels being contrasted; when set to 'No', any feature that varies significantly across all levels will be included (i.e., exclude any feature that is not significantly different across all levels). See examples below." /> |
15 </xml> | 15 </xml> |
16 <xml name="cplots"> | |
17 <param name="cplot_y" label="C-plot Y-axis" type="select" help="Choose the Y-axis for C-plots."> | |
18 <option value="correlation">Plot VIP versus correlation</option> | |
19 <option value="covariance">Plot VIP versus covariance</option> | |
20 </param> | |
21 <param | |
22 name="cplot_p" | |
23 type="boolean" | |
24 checked="TRUE" | |
25 truevalue="TRUE" | |
26 falsevalue="FALSE" | |
27 label="Produce predictor C-plot" | |
28 help="When this option is set to 'Yes', correlation will be plotted against vip4 for predictor loadings." /> | |
29 <param | |
30 name="cplot_o" | |
31 type="boolean" | |
32 checked="TRUE" | |
33 truevalue="TRUE" | |
34 falsevalue="FALSE" | |
35 label="Produce orthogonal C-plot" | |
36 help="When this option is set to 'Yes', correlation will be plotted against vip4 for orthogonal loadings." /> | |
37 </xml> | |
16 </macros> | 38 </macros> |
17 | 39 |
18 <requirements> | 40 <requirements> |
19 <requirement type="package">r-batch</requirement> | 41 <requirement type="package">r-batch</requirement> |
20 <requirement type="package">bioconductor-ropls</requirement> | 42 <requirement type="package">bioconductor-ropls</requirement> |
21 </requirements> | 43 </requirements> |
22 | 44 |
23 <stdio> | 45 <stdio> |
24 <exit_code range="1:" level="fatal" /> | 46 <exit_code range="1:" level="fatal" /> |
25 </stdio> | 47 </stdio> |
26 | 48 |
27 <command><![CDATA[ | 49 <command><![CDATA[ |
38 pairSigFeatOnly "$signif_test.pairSigFeatOnly" | 60 pairSigFeatOnly "$signif_test.pairSigFeatOnly" |
39 #end if | 61 #end if |
40 levCSV '$levCSV' | 62 levCSV '$levCSV' |
41 matchingC '$matchingC' | 63 matchingC '$matchingC' |
42 labelFeatures '$labelFeatures' | 64 labelFeatures '$labelFeatures' |
65 #if str( $xplots.expPlot ) == "none": | |
66 cplot_p "FALSE" | |
67 cplot_o "FALSE" | |
68 cplot_y "correlation" | |
69 #else if str( $xplots.expPlot ) == "cplot": | |
70 cplot_p "$xplots.cplot_p" | |
71 cplot_o "$xplots.cplot_o" | |
72 cplot_y "$xplots.cplot_y" | |
73 #end if | |
43 contrast_detail '$contrast_detail' | 74 contrast_detail '$contrast_detail' |
44 contrast_corcov '$contrast_corcov' | 75 contrast_corcov '$contrast_corcov' |
45 contrast_salience '$contrast_salience' | 76 contrast_salience '$contrast_salience' |
46 ]]></command> | 77 ]]></command> |
47 | 78 |
53 <conditional name="signif_test"> | 84 <conditional name="signif_test"> |
54 <param name="tesC" label="Univariate significance-test" type="select" help="Either 'none' or the name of the statistical test that was run by the 'Univariate' tool to produce the variableMetadata file; that name must also be a portion of the column names in that file."> | 85 <param name="tesC" label="Univariate significance-test" type="select" help="Either 'none' or the name of the statistical test that was run by the 'Univariate' tool to produce the variableMetadata file; that name must also be a portion of the column names in that file."> |
55 <option value="none">none - Display all features from variableMetadata (rather than choosing a subset based on significance in univariate testing)</option> | 86 <option value="none">none - Display all features from variableMetadata (rather than choosing a subset based on significance in univariate testing)</option> |
56 <option value="ttest">ttest - Student's t-test (parametric test, qualitative factor with exactly 2 levels)</option> | 87 <option value="ttest">ttest - Student's t-test (parametric test, qualitative factor with exactly 2 levels)</option> |
57 <option value="anova">anova - Analysis of variance (parametric test, qualitative factor with more than 2 levels)</option> | 88 <option value="anova">anova - Analysis of variance (parametric test, qualitative factor with more than 2 levels)</option> |
58 <option value="wilcoxon">wilcoxon - Wilcoxon rank test (nonparametric test, qualitative factor with exactly 2 levels)</option> | 89 <option value="wilcoxon">wilcoxon - Wilcoxon rank test (nonparametric test, qualitative factor with exactly 2 levels)</option> |
59 <option value="kruskal">kruskal - Kruskal-Wallis rank test (nonparametric test, qualitative factor with more than 2 levels)</option> | 90 <option value="kruskal">kruskal - Kruskal-Wallis rank test (nonparametric test, qualitative factor with more than 2 levels)</option> |
60 </param> | 91 </param> |
61 <when value="none" /> | 92 <when value="none" /> |
62 <when value="ttest"> | 93 <when value="ttest"> |
63 <expand macro="paramPairSigFeatOnly" /> | 94 <expand macro="paramPairSigFeatOnly" /> |
102 <option value="none">do no generic matching (default)</option> | 133 <option value="none">do no generic matching (default)</option> |
103 <option value="wildcard" selected="true">use wild-cards for matching level-names</option> | 134 <option value="wildcard" selected="true">use wild-cards for matching level-names</option> |
104 <option value="regex">use regular expressions for matching level-names</option> | 135 <option value="regex">use regular expressions for matching level-names</option> |
105 </param> | 136 </param> |
106 <param name="labelFeatures" type="text" value="3" label="How many features having extreme loadings should be labelled on cov-vs.-cor plot" help="Specify the number of features at each of the loading-extremes that should be labelled (with the name of the feature) on the covariance-vs.-correlation plot; specify 'ALL' to label all features or '0' to label no features; this choice has no effect on the OPLS-DA loadings plot."/> | 137 <param name="labelFeatures" type="text" value="3" label="How many features having extreme loadings should be labelled on cov-vs.-cor plot" help="Specify the number of features at each of the loading-extremes that should be labelled (with the name of the feature) on the covariance-vs.-correlation plot; specify 'ALL' to label all features or '0' to label no features; this choice has no effect on the OPLS-DA loadings plot."/> |
138 <conditional name="xplots"> | |
139 <param name="expPlot" label="Extra plots to include" type="select" help="Choosing 'none' hides further choices."> | |
140 <option value="none">none - Do not include additonal extra plots.</option> | |
141 <option value="cplot">cplot - Include C-plots (predictor-loading vs. 'vip4p' and orthogonal-loading versus 'vip4o')</option> | |
142 </param> | |
143 <when value="none" /> | |
144 <when value="cplot"> | |
145 <expand macro="cplots" /> | |
146 </when> | |
147 </conditional> | |
107 </inputs> | 148 </inputs> |
108 | 149 |
109 <outputs> | 150 <outputs> |
110 <!-- | 151 <!-- |
111 pdf1: summaries of each contrasts, clearly labelled by level=pair name | 152 pdf1: summaries of each contrasts, clearly labelled by level=pair name |
135 --> | 176 --> |
136 <data name="contrast_salience" label="${tool.name}_${variableMetadata_in.name}_salience" format="tabular" /> | 177 <data name="contrast_salience" label="${tool.name}_${variableMetadata_in.name}_salience" format="tabular" /> |
137 </outputs> | 178 </outputs> |
138 | 179 |
139 <tests> | 180 <tests> |
181 <!-- test #1 --> | |
140 <test> | 182 <test> |
141 <param name="dataMatrix_in" value="input_dataMatrix.tsv"/> | 183 <param name="dataMatrix_in" value="input_dataMatrix.tsv"/> |
142 <param name="sampleMetadata_in" value="input_sampleMetadata.tsv"/> | 184 <param name="sampleMetadata_in" value="input_sampleMetadata.tsv"/> |
143 <param name="variableMetadata_in" value="input_variableMetadata.tsv"/> | 185 <param name="variableMetadata_in" value="input_variableMetadata.tsv"/> |
144 <param name="tesC" value="kruskal"/> | 186 <param name="tesC" value="kruskal"/> |
158 <has_text text="vip4p" /> | 200 <has_text text="vip4p" /> |
159 <has_text text="vip4o" /> | 201 <has_text text="vip4o" /> |
160 <has_text text="level1Level2Sig" /> | 202 <has_text text="level1Level2Sig" /> |
161 <!-- first matched line --> | 203 <!-- first matched line --> |
162 <has_text text="M349.2383T700" /> | 204 <has_text text="M349.2383T700" /> |
163 <has_text text="-0.05007" /> | 205 <has_text text="-0.04051509" /> |
164 <has_text text="-5.8455" /> | 206 <has_text text="-0.001964912" /> |
165 <has_text text="0.0961269" /> | 207 <has_text text="0.02106343" /> |
166 <has_text text="0.1848301" /> | 208 <has_text text="0.2446366813" /> |
167 <!-- second matched line --> | 209 <!-- second matched line --> |
168 <has_text text="M207.9308T206" /> | 210 <has_text text="M207.9308T206" /> |
169 <has_text text="-0.2967565" /> | 211 <has_text text="0.504885262" /> |
170 <has_text text="-19.56942" /> | 212 <has_text text="0.020749097" /> |
171 <has_text text="1.6023" /> | 213 <has_text text="0.207196379" /> |
172 <has_text text="1.35368" /> | 214 <has_text text="0.04438632" /> |
173 <!-- third matched line --> | 215 <!-- third matched line --> |
174 <has_text text="M211.0607T263" /> | 216 <has_text text="M211.0607T263" /> |
175 <has_text text="0.47052" /> | 217 <has_text text="0.0680900" /> |
176 <has_text text="15.910087" /> | 218 <has_text text="0.0020163" /> |
177 <has_text text="0.89838" /> | 219 <has_text text="0.0201345" /> |
178 <has_text text="0.125372" /> | 220 <has_text text="0.0690773" /> |
179 </assert_contents> | 221 </assert_contents> |
180 </output> | 222 </output> |
181 <output name="contrast_salience"> | 223 <output name="contrast_salience"> |
182 <assert_contents> | 224 <assert_contents> |
183 <!-- column-labels line --> | 225 <!-- column-labels line --> |
198 <has_text text="9999" /> | 240 <has_text text="9999" /> |
199 <has_text text="12.87766096" /> | 241 <has_text text="12.87766096" /> |
200 </assert_contents> | 242 </assert_contents> |
201 </output> | 243 </output> |
202 </test> | 244 </test> |
245 <!-- test #2 --> | |
203 <test> | 246 <test> |
204 <param name="dataMatrix_in" value="input_dataMatrix.tsv"/> | 247 <param name="dataMatrix_in" value="input_dataMatrix.tsv"/> |
205 <param name="sampleMetadata_in" value="input_sampleMetadata.tsv"/> | 248 <param name="sampleMetadata_in" value="input_sampleMetadata.tsv"/> |
206 <param name="variableMetadata_in" value="input_variableMetadata.tsv"/> | 249 <param name="variableMetadata_in" value="input_variableMetadata.tsv"/> |
207 <param name="tesC" value="kruskal"/> | 250 <param name="tesC" value="kruskal"/> |
220 <has_text text="covariance" /> | 263 <has_text text="covariance" /> |
221 <has_text text="vip4p" /> | 264 <has_text text="vip4p" /> |
222 <has_text text="vip4o" /> | 265 <has_text text="vip4o" /> |
223 <has_text text="level1Level2Sig" /> | 266 <has_text text="level1Level2Sig" /> |
224 <!-- first matched line --> | 267 <!-- first matched line --> |
225 <has_text text="M349.2383T700" /> | 268 <has_text text="M200.005T296" /> |
226 <has_text text="-0.99601577" /> | 269 <has_text text="-0.2803571" /> |
227 <has_text text="-947.55795176" /> | 270 <has_text text="-0.0115899" /> |
228 <!-- second matched line --> | 271 <has_text text="0.1157346" /> |
229 <has_text text="M207.9308T206" /> | 272 <has_text text="0.0647860" /> |
230 <has_text text="0.688549" /> | |
231 <has_text text="58.22352" /> | |
232 <has_text text="1.394687" /> | |
233 <has_text text="0.06049885" /> | |
234 <!-- third matched line --> | |
235 <has_text text="M211.0607T263" /> | |
236 <has_text text="-0.572018" /> | |
237 <has_text text="-14.57769" /> | |
238 <has_text text="0.7780899" /> | |
239 <has_text text="0.3678166776" /> | |
240 </assert_contents> | 273 </assert_contents> |
241 </output> | 274 </output> |
242 <output name="contrast_salience"> | 275 <output name="contrast_salience"> |
243 <assert_contents> | 276 <assert_contents> |
244 <!-- column-labels line --> | 277 <!-- column-labels line --> |
259 <has_text text="9999" /> | 292 <has_text text="9999" /> |
260 <has_text text="12.87766096" /> | 293 <has_text text="12.87766096" /> |
261 </assert_contents> | 294 </assert_contents> |
262 </output> | 295 </output> |
263 </test> | 296 </test> |
297 <!-- test #3 --> | |
264 <test> | 298 <test> |
265 <param name="dataMatrix_in" value="input_dataMatrix.tsv"/> | 299 <param name="dataMatrix_in" value="input_dataMatrix.tsv"/> |
266 <param name="sampleMetadata_in" value="input_sampleMetadata.tsv"/> | 300 <param name="sampleMetadata_in" value="input_sampleMetadata.tsv"/> |
267 <param name="variableMetadata_in" value="input_variableMetadata.tsv"/> | 301 <param name="variableMetadata_in" value="input_variableMetadata.tsv"/> |
268 <param name="tesC" value="none"/> | 302 <param name="tesC" value="none"/> |
280 <has_text text="covariance" /> | 314 <has_text text="covariance" /> |
281 <has_text text="vip4p" /> | 315 <has_text text="vip4p" /> |
282 <has_text text="vip4o" /> | 316 <has_text text="vip4o" /> |
283 <!-- first matched line --> | 317 <!-- first matched line --> |
284 <has_text text="M349.2383T700" /> | 318 <has_text text="M349.2383T700" /> |
285 <has_text text="-0.64331257" /> | 319 <has_text text="-0.4732226" /> |
286 <has_text text="-161.82220" /> | 320 <has_text text="-0.0506172" /> |
287 <has_text text="1.862455" /> | 321 <has_text text="0.5246766" /> |
288 <has_text text="0.2105143" /> | 322 <has_text text="0.0103341" /> |
289 <!-- second matched line --> | 323 <!-- second matched line --> |
290 <has_text text="M207.9308T206" /> | 324 <has_text text="M207.9308T206" /> |
291 <has_text text="-0.313507" /> | 325 <has_text text="0.4927151" /> |
292 <has_text text="-20.0476" /> | 326 <has_text text="0.0203715" /> |
293 <has_text text="1.6956987" /> | 327 <has_text text="0.2111623" /> |
294 <has_text text="1.19247" /> | 328 <has_text text="0.0488654" /> |
295 <!-- third matched line --> | |
296 <has_text text="M211.0607T263" /> | |
297 <has_text text="-0.38986114" /> | |
298 <has_text text="-23.747718" /> | |
299 <has_text text="1.064296856" /> | |
300 <has_text text="1.16507455" /> | |
301 </assert_contents> | 329 </assert_contents> |
302 </output> | 330 </output> |
303 <output name="contrast_salience"> | 331 <output name="contrast_salience"> |
304 <assert_contents> | 332 <assert_contents> |
305 <!-- column-labels line --> | 333 <!-- column-labels line --> |
320 <has_text text="9999" /> | 348 <has_text text="9999" /> |
321 <has_text text="12.87766096" /> | 349 <has_text text="12.87766096" /> |
322 </assert_contents> | 350 </assert_contents> |
323 </output> | 351 </output> |
324 </test> | 352 </test> |
353 <!-- test #4 --> | |
354 <test> | |
355 <param name="dataMatrix_in" value="issue1_input_dataMatrix.tsv"/> | |
356 <param name="sampleMetadata_in" value="issue1_input_sampleMetadata.tsv"/> | |
357 <param name="variableMetadata_in" value="issue1_input_variableMetadata.tsv"/> | |
358 <param name="tesC" value="none"/> | |
359 <param name="facC" value="tissue_flowering"/> | |
360 <param name="labelFeatures" value="3"/> | |
361 <param name="levCSV" value="*"/> | |
362 <param name="matchingC" value="wildcard"/> | |
363 <output name="contrast_corcov"> | |
364 <assert_contents> | |
365 <!-- column-labels line --> | |
366 <has_text text="featureID" /> | |
367 <has_text text="factorLevel1" /> | |
368 <has_text text="factorLevel2" /> | |
369 <has_text text="correlation" /> | |
370 <has_text text="covariance" /> | |
371 <has_text text="vip4p" /> | |
372 <has_text text="vip4o" /> | |
373 <!-- first matched line --> | |
374 <has_text text="NM516T251" /> | |
375 <has_text text="flower_yes" /> | |
376 <has_text text="other" /> | |
377 <has_text text="0.3490559" /> | |
378 <has_text text="0.0260147" /> | |
379 <has_text text="0.4377872" /> | |
380 <has_text text="0.5916089" /> | |
381 <has_text text="0.0260147" /> | |
382 <has_text text="0.0438942" /> | |
383 <has_text text="516.0845" /> | |
384 <has_text text="250.8762" /> | |
385 </assert_contents> | |
386 </output> | |
387 <output name="contrast_salience"> | |
388 <assert_contents> | |
389 <!-- column-labels line --> | |
390 <has_text text="featureID" /> | |
391 <has_text text="salientLevel" /> | |
392 <has_text text="salientRCV" /> | |
393 <has_text text="salience" /> | |
394 <has_text text="mz" /> | |
395 <has_text text="rt" /> | |
396 <!-- first matched line --> | |
397 <has_text text="PM518T369" /> | |
398 <has_text text="flower_yes" /> | |
399 <has_text text="0.58655260" /> | |
400 <has_text text="4.414469" /> | |
401 <has_text text="518.1656" /> | |
402 <has_text text="368.59817" /> | |
403 </assert_contents> | |
404 </output> | |
405 </test> | |
325 </tests> | 406 </tests> |
326 <help><![CDATA[ | 407 <help><![CDATA[ |
327 | 408 |
328 **Run PLS-DA Contrasts of Univariate Results** | 409 **Run PLS-DA Contrasts of Univariate Results** |
329 ---------------------------------------------- | 410 ---------------------------------------------- |
354 For instance, suppose that you ran Kruskal-Wallis testing for a column named 'cluster' in sampleMetadata that has values 'k1' and 'k2' and at least one other value. | 435 For instance, suppose that you ran Kruskal-Wallis testing for a column named 'cluster' in sampleMetadata that has values 'k1' and 'k2' and at least one other value. |
355 | 436 |
356 - A column of variableMetadata would be labelled 'cluster_kruskal_sig' and would have values '1' and '0'; when the samples are grouped by 'cluster', '1' means that there is strong evidence against the hypothesis that there is no difference among the intensities for the feature across all sample-groups. | 437 - A column of variableMetadata would be labelled 'cluster_kruskal_sig' and would have values '1' and '0'; when the samples are grouped by 'cluster', '1' means that there is strong evidence against the hypothesis that there is no difference among the intensities for the feature across all sample-groups. |
357 - A column of variableMetadata would be labelled 'cluster_kruskal_k1.k2_sig' and would have values '1' and '0', where '1' means that there is significant evidence against the hypothesis that samples from sampleMetadata whose 'cluster' column contains 'k1' or 'k2' have the same intensity for that feature. | 438 - A column of variableMetadata would be labelled 'cluster_kruskal_k1.k2_sig' and would have values '1' and '0', where '1' means that there is significant evidence against the hypothesis that samples from sampleMetadata whose 'cluster' column contains 'k1' or 'k2' have the same intensity for that feature. |
358 | 439 |
359 The 'PLS-DA Contrasts' tool produces graphics and data for OPLS-DA contrasts of feature-intensities between significantly different pairs of factor-levels. For each factor-level, the tool performs a contrast with all other factor-levels combined and then separately with each other factor-level. | 440 The 'PLS-DA Contrasts' tool produces graphics and data for OPLS-DA contrasts of feature-intensities between significantly different pairs of factor-levels. For each factor-level, the tool performs a contrast with all other factor-levels combined and then separately with each other factor-level. |
360 | 441 |
361 **Along the left-to-right axis, the plots show the supervised projection of the variation explained by the predictor** (i.e., the factor specified when invoking the tool); **the top-to-bottom axis displays the variation that is orthogonal to the predictor level** (i.e., independent of it). | 442 **Along the left-to-right axis, the plots show the supervised projection of the variation explained by the predictor** (i.e., the factor specified when invoking the tool); **the top-to-bottom axis displays the variation that is orthogonal to the predictor level** (i.e., independent of it). |
362 | 443 |
363 Although this tool can be used in a purely exploratory manner by supplying the variableMetadata file without the columns added by the W4M 'Univariate' tool, **the preferred workflow is to use univariate testing to exclude features that are not significantly different and use OPLS-DA to visualize the differences identified in univariate testing** (Th]]>é<![CDATA[venot *et al.*, 2015); an appropriate exception would be to visualize contrasts of a specific list of metabolites. | 444 Although this tool can be used in a purely exploratory manner by supplying the variableMetadata file without the columns added by the W4M 'Univariate' tool, **the preferred workflow may be to use univariate testing to exclude features that are not significantly different and then to use OPLS-DA to visualize the differences identified in univariate testing** (Th]]>é<![CDATA[venot *et al.*, 2015); an appropriate exception would be to visualize contrasts of a specific list of metabolites. |
364 | 445 |
365 It must be stressed that there may be no *single* definitive computational approach to select features that are reliable biomarkers, especially from a small number of samples or experiments. A few possible choices are: | 446 It must be stressed that there may be no *single* definitive computational approach to select features that are reliable biomarkers, especially from a small number of samples or experiments. A few possible choices are: |
366 | 447 |
367 - picking features with maximum loadings along the projection parallel to the predictor (loadp), | 448 - picking features with maximum loadings along the projection parallel to the predictor (loadp), |
368 - examining extreme values on S-PLOTs (for which covariance is linearly related to loadp), | 449 - examining extreme values on S-PLOTs (for which covariance is linearly related to loadp), |
369 - examining "variable importance in projection VIP for OPLS-DA" (Galindo-Prieto *et al.* 2014), and | 450 - examining "variable importance in projection VIP for OPLS-DA" (Galindo-Prieto *et al.* 2014), and |
370 - examining a feature's "selectivity ratio" (Rajalahti *et al.*, 2009). | 451 - examining a feature's "selectivity ratio" (Rajalahti *et al.*, 2009). |
371 | 452 |
372 In this spirit, this tool reports the S-PLOT covariance and correlation (Wiklund *op. cit.*) and VIP metrics, and it introduces an informal "salience" metric to flag features that may merit attention without dimensional reduction; future versions may add selectivity ratio. | 453 In this spirit, this tool reports the S-PLOT covariance and correlation (Wiklund *op. cit.*) and VIP metrics, and it introduces an informal "salience" metric to flag features that may merit attention without dimensional reduction; future versions may add selectivity ratio. |
373 | 454 |
374 For a more systematic approach to biomarker identification, please consider the W4M 'biosigner' tool (Rinuardo *et al.* 2016), which applies three different identification metrics to the selection process. | 455 For a more systematic approach to biomarker identification, please consider the W4M 'biosigner' tool (Rinuardo *et al.* 2016), which applies three different identification metrics to the selection process. |
375 | 456 |
376 Regardless of how any potential biomarker is identified, further validation analysis (e.g., independent confirmatory experiments) is needed before it is recommended for general application. | 457 Regardless of how any potential biomarker is identified, further validation analysis (e.g., independent confirmatory experiments) is needed before it is recommended for general application. |
377 | 458 |
451 | | 532 | |
452 | 533 |
453 [OUT] Contrast-detail output PDF | 534 [OUT] Contrast-detail output PDF |
454 | Several plots for each two-projection OPLS-DA analysis: | 535 | Several plots for each two-projection OPLS-DA analysis: |
455 | 536 |
456 - (top-left) **correlation-versus-covariance plot** of OPLS-DA results (a work-alike for the S-PLOT, computed using formula in Supplement to Wiklund, *op. cit.*); point-color becomes saturated as the "variable importance in projection to the predictive components" (VIP\ :subscript:`4,p` from Galindo-Prieto *et al.* 2014) ranges from 0.83 and 1.21 (Mehmood *et al.* 2012) | 537 - (first row, left) **correlation-versus-covariance plot** of OPLS-DA results (a work-alike for the S-PLOT, computed using formula in Supplement to Wiklund, *op. cit.*); point-color becomes saturated as the "variable importance in projection to the predictive components" (VIP\ :subscript:`4,p` from Galindo-Prieto *et al.* 2014) ranges from 0.83 and 1.21 (Mehmood *et al.* 2012), for use to identify features for consideration as biomarkers. |
457 - (bottom-left) **model-overview plot** for the two projections; grey bars are the correlation coefficient for the fitted data; black bars indicate performance in cross-validation tests (Th]]>é<![CDATA[venot, 2017) | 538 - (second row, left) **model-overview plot** for the two projections; grey bars are the correlation coefficient for the fitted data; black bars indicate performance in cross-validation tests (Th]]>é<![CDATA[venot, 2017) |
458 - (top-right) OPLS-DA **scores-plot** for the two projections (Th]]>é<![CDATA[venot *et al.*, 2015) | 539 - (first row, right) OPLS-DA **scores-plot** for the two projections (Th]]>é<![CDATA[venot *et al.*, 2015) |
459 - (bottom-right) OPLS-DA **loadings-plot** for the two projections (*ibid.*) | 540 - (second row, right) **correlation-versus-covariance plot** of OPLS-DA results **orthogonal to the predictor** (see section "S-Plot of Orthogonal Component" in Wiklund, *op. cit.*, pp. 120-121; this characterizes features with the greatest variation independent of the predictor). |
541 - (third row, left, when "**predictor C-plot**" is chosen under "Extra plots to include") plot of the covariance or correlation vs. the VIP for the projection *parallel to the predictor*, for use to identify features for consideration as biomarkers. | |
542 - (third row, right, when "**orthogonal C-plot**" is chosen under "Extra plots to include") plotof the covariance or correlation vs. the VIP for the projection *orthogonal to the predictor*, for use to identify features varying considerably without regard to the predictor. | |
460 | 543 |
461 [OUT] Contrast Correlation-Covarinace data TABULAR | 544 [OUT] Contrast Correlation-Covarinace data TABULAR |
462 | A tab-separated values file of metadata for each feature for each contrast in which it was included. | 545 | A tab-separated values file of metadata for each feature for each contrast in which it was included. |
463 | Thus, a given feature may appear many times, but *the combination of featureID, factorLevel1, and factorLevel2 will be unique.* | 546 | Thus, a given feature may appear many times, but *the combination of featureID, factorLevel1, and factorLevel2 will be unique.* |
464 | This file has the following columns: | 547 | This file has the following columns: |
473 - **loadp** - variable loading for the predictive projection (Wiklund *op. cit.*) | 556 - **loadp** - variable loading for the predictive projection (Wiklund *op. cit.*) |
474 - **loado** - variable loading for the orthogonal projection (*ibid.*) | 557 - **loado** - variable loading for the orthogonal projection (*ibid.*) |
475 - **level1Level2Sig** - (Only present when a test other than "none" is chosen) '1' when feature varies significantly across all classes (i.e., not pair-wise); '0' otherwise | 558 - **level1Level2Sig** - (Only present when a test other than "none" is chosen) '1' when feature varies significantly across all classes (i.e., not pair-wise); '0' otherwise |
476 | 559 |
477 [OUT] Feature "Salience" data TABULAR | 560 [OUT] Feature "Salience" data TABULAR |
478 | Metrics for the "salient level" for each feature, i.e., the level at which the feature is more prominent than any other level. This is *not* at all related to the SIMCA OPLS-DA S-PLOT; rather, it is intended as a potential (and unproven) way to identify features that may suggest potential biomarkers without dimensional reduction of data. This is a tab-separated values file having the following columns: | 561 | Metrics for the "salient level" for each feature, i.e., the level at which the feature is more prominent than any other level. This is *not* at all related to the SIMCA OPLS-DA S-PLOT; rather, it is intended as a potential way to discover features for consideration as potential biomarkers without dimensionally reducting the data. This is a tab-separated values file having the following columns: |
479 | 562 |
480 - **featureID** - feature identifier | 563 - **featureID** - feature identifier |
481 - **salientLevel** - salient level, i.e., for the feature, the class-level having the greatest median intensity | 564 - **salientLevel** - salient level, i.e., for the feature, the class-level having the greatest median intensity |
482 - **salientRCV** - salient robust coefficient of variation, i.e., for the feature, the mean absolute deviation of the intensity for the salient level divided by the median intensity for the salient level | 565 - **salientRCV** - salient robust coefficient of variation, i.e., for the feature, the mean absolute deviation of the intensity for the salient level divided by the median intensity for the salient level |
483 - **salience** - salience, i.e., for the feature, the median of the class-level having the greatest intensity divided by the mean of the medians for all class-levels | 566 - **salience** - salience, i.e., for the feature, the median of the class-level having the greatest intensity divided by the mean of the medians for all class-levels |
648 OPLS-DA, SIMCA, and S-PLOT are registered trademarks of the Umetrics company. http://umetrics.com/about-us/trademarks | 731 OPLS-DA, SIMCA, and S-PLOT are registered trademarks of the Umetrics company. http://umetrics.com/about-us/trademarks |
649 | 732 |
650 | 733 |
651 Release notes | 734 Release notes |
652 ------------- | 735 ------------- |
736 | |
737 0.98.10 | |
738 | |
739 - new feature: C-plots of VIP versus correlation or relative covariance. | |
740 - bug fix: Handle issue 2 - features now are only pareto-scaled per Wikland *op cit.*. | |
741 | |
742 0.98.9 | |
743 | |
744 - bug fix: Handle issue 1 - handle features removed by ropls because variance is less than 2.2e-16. | |
653 | 745 |
654 0.98.8 | 746 0.98.8 |
655 | 747 |
656 - new feature: Replace loadings plot with correlation-versus-covariance plot for orthogonal features, i.e., the consistency of features influencing within-treatment variation (which is linearly related to the loading of the orthogonal projection) versus consistency. This eliminates the need for the parameter to suppress labels for features with extreme orthogonal loadings | 748 - new feature: Replace loadings plot with correlation-versus-covariance plot for orthogonal features, i.e., the consistency of features influencing within-treatment variation (which is linearly related to the loading of the orthogonal projection) versus consistency. This eliminates the need for the parameter to suppress labels for features with extreme orthogonal loadings |
657 | 749 |