annotate minfi_ppquantile.xml @ 16:491cf4ada3a3 draft

Uploaded
author kpbioteam
date Fri, 22 Feb 2019 08:15:31 -0500
parents f47e5cca1696
children 9c6fbb7d5a2a
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
7
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
1 <tool id="minfi_ppquantile" name="Minfi Preprocess Quantile" version="@MINFI_VERSION@">
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
2 <description>implements stratified quantile normalization preprocessing</description>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
3 <macros>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
4 <import>macros.xml</import>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
5 </macros>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
6 <expand macro="requirements">
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
7 <requirement type="package" version="0.6.0">bioconductor-illuminahumanmethylation450kanno.ilmn12.hg19</requirement>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
8 </expand>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
9 <command detect_errors="exit_code">
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
10 <![CDATA[
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
11 Rscript '$minfi_pp_script'
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
12 ]]>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
13 </command>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
14 <configfiles>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
15 <configfile name="minfi_pp_script"><![CDATA[
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
16 require("minfi", quietly = TRUE)
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
17 RGSet <- get(load('$rgset'))
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
18
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
19 GRSet <- preprocessQuantile(RGSet, fixOutliers = TRUE,
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
20 removeBadSamples = TRUE, badSampleCutoff = 10.5,
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
21 quantileNormalize = TRUE, stratified = TRUE,
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
22 mergeManifest = FALSE, sex = NULL)
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
23
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
24 save(GRSet,file = '$grset')
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
25
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
26 ]]>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
27 </configfile>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
28 </configfiles>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
29 <inputs>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
30 <param type="data" name="rgset" format="rdata" label="RGChannelSet" help="These classes represents raw (unprocessed) data from a two color micro array; specifically an Illumina methylation array." />
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
31 </inputs>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
32 <outputs>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
33 <data name="grset" format="rdata" label="GenomicRatioSet"/>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
34 </outputs>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
35 <tests>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
36 <test>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
37 <param name="rgset" value="RGChannelSet.rdata"/>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
38 <output name="grset" file="QuantileGenomicRatioSet.rdata"/>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
39 </test>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
40 </tests>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
41 <help><![CDATA[
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
42 The normalization procedure is applied to the Meth and Unmeth intensities separately. The distribution of type I and type II signals is forced to be the same by first quantile normalizing the type II probes across samples and then interpolating a reference distribution to which we normalize the type I probes. Since probe types and probe regions are confounded and we know that DNAm distributions vary across regions we stratify the probes by region before applying this interpolation.
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
43 ]]></help>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
44 <expand macro="citations" />
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
45 </tool>
f47e5cca1696 Uploaded
kpbioteam
parents:
diff changeset
46