annotate minfi_analysis.xml @ 82:369fef559cfc draft default tip

"planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
author kpbioteam
date Sun, 23 Feb 2020 17:00:42 -0500
parents 8ab24a5229bd
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
1 <?xml version='1.0'?>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
2 <tool id='minfi_analysis' name='Infinium Human Methylation BeadChip' version='2.1.0'>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
3 <description>Determines differentially methylated regions and positions from Infinium Methylation Assays</description>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
4 <macros>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
5 <import>macros.xml</import>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
6 </macros>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
7 <expand macro='requirements'>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
8 <requirement type='package' version='0.6.0'>bioconductor-illuminahumanmethylation450kanno.ilmn12.hg19</requirement>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
9 </expand>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
10 <command detect_errors='exit_code'><![CDATA[
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
11 #for $counter, $input in enumerate($files_red):
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
12 #set $redname = str( getattr( $input, 'element_identifier', 'sample' ) ).replace( '/', '-' ).replace( '\t', '-' )
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
13 ln -s $input ./${redname} &&
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
14 #end for
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
15 #for $counter, $input in enumerate($files_grn):
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
16 #set $grnname = str( getattr( $input, 'element_identifier', 'sample' ) ).replace( '/', '-' ).replace( '\t', '-' )
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
17 ln -s $input ./${grnname} &&
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
18 #end for
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
19 Rscript '$minfi_analysis_script'
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
20 ]]></command>
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
21 <configfiles>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
22 <configfile name='minfi_analysis_script'><![CDATA[
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
23 require('minfi', quietly = TRUE)
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
24 require('IlluminaHumanMethylation27kanno.ilmn12.hg19', quietly = TRUE)
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
25 require('IlluminaHumanMethylation450kanno.ilmn12.hg19', quietly = TRUE)
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
26 require('IlluminaHumanMethylationEPICanno.ilm10b4.hg19', quietly = TRUE)
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
27 options(warn = -1)
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
28 RGSet <- read.metharray(list.files(pattern='_Red.idat')) #load .IDAT files
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
29
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
30 MSet <- preprocessRaw(RGSet) #create objects contains CpGs signals
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
31
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
32 qc <- getQC(MSet)
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
33 write.table(qc, '$qctab') #optional - provides a simple quality control matrix and plot
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
34 png('$qcpng')
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
35 plotQC(qc)
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
36 dev.off()
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
37
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
38 RSet <- ratioConvert(MSet, what = 'both', keepCN = TRUE) #store Beta values and/or M values
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
39 GRSet <- mapToGenome(RSet)
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
40
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
41
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
42 if ('$optpp' == 'na' ) {
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
43 GRSet <- mapToGenome(RSet) #mapping Ilumina methylation array data to the genome
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
44 } else if ('$optpp' == 'ppfun' ) {
82
369fef559cfc "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
kpbioteam
parents: 81
diff changeset
45 GRSet <- preprocessFunnorm(RGSet) #optional - implements the functional normalisation algorithm
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
46 } else if ('$optpp' == 'ppq' ) {
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
47 GRSet <- preprocessQuantile(RGSet, fixOutliers = TRUE,
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
48 removeBadSamples = TRUE, badSampleCutoff = 10.5,
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
49 quantileNormalize = TRUE, stratified = TRUE,
82
369fef559cfc "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
kpbioteam
parents: 81
diff changeset
50 mergeManifest = FALSE, sex = NULL) #optional - implements stratified quantile normalisation preprocessing
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
51 } else if ('$optpp' == 'ppsnp' ) {
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
52 snps <- getSnpInfo(GRSet) #optional - retrieve the chromosome and the position of each SNP
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
53 write.table(snps, '$table')
82
369fef559cfc "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
kpbioteam
parents: 81
diff changeset
54 GRSet <- dropLociWithSnps(GRSet, snps=c('SBE','CpG'), maf=0) #optional - drop the probes that contain either an SNP at the CpG interrogation or at the single nucleotide extensions
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
55 }
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
56 pheno <- read.table('$phenotype_table',skip = 1)
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
57 group <- pheno\$V2
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
58 pair <- factor(pheno\$V3)
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
59
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
60 design.matrix <- model.matrix(~ group + pair)
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
61
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
62 maxGap <- as.numeric('$maxgap_size')
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
63 if(is.null(GRSet\$cluster)){
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
64 cluster = NULL
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
65 maxGap = maxGap
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
66 } else {
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
67 cluster = GRSet\$cluster
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
68 maxGap = NULL
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
69 }
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
70
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
71 dmrs <- bumphunter(GRSet,
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
72 design = design.matrix,
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
73 cluster = cluster,
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
74 maxGap = maxGap,
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
75 cutoff = as.numeric('$cutoff_size'),
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
76 nullMethod = '$null_method',
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
77 B = as.numeric('$number_of_resamples'))
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
78 dmrGR <- dmrs\$table[,c(1,2,3)]
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
79 colnames(dmrGR) <- c('chr','start','end')
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
80 write.table(dmrGR, file= '$dmr', quote = FALSE,col.names = TRUE, row.names = FALSE, sep = '\t')
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
81
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
82 tab <- read.table('$ucsc_genome')
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
83 tab <- tab[,-(11:14),drop=FALSE]
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
84 tab <- tab[,c(1,4,5,10)]
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
85 colnames(tab) <- c('chr','start','end','names')
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
86
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
87 dmp <- dmpFinder(dat = getBeta(GRSet),pheno = read.table('$phenotype_table',skip=1)[,'V2'], type = '$phenotype', qCutoff = as.numeric('$q_cutoff'), shrinkVar = '$variance_shrinkage')
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
88 dmp[,'names'] <- rownames(dmp)
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
89 data <- merge(dmp, tab, by='names',sort = TRUE)
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
90 data <- data[,c(6,7,8,1,4,5)]
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
91 write.table(data, file= '$dmp', quote = FALSE,col.names = TRUE, row.names = FALSE, sep = '\t')
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
92
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
93 ]]></configfile>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
94 </configfiles>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
95 <inputs>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
96 <param type='data' name='files_red' multiple='true' format='idat' label='Red .IDAT files' help='Red .IDAT files extension is followed by the unmethylated signal intensity read in the red channel.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
97 <param type='data' name='files_grn' multiple='true' format='idat' label='Green .IDAT files' help='Green .IDAT files extension is followed by the methylated signal intensity read in the green channel.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
98 <param name='optpp' type='select' label='(Optional) Preprocessing Method' help='Mapping Ilumina methylation array data to the genome with or without additional preprocessing methods.'>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
99 <option value='na'>No Selection (use default)</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
100 <option value='ppfun'>Preprocess Funnorm</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
101 <option value='ppq'>Preprocess Quantile</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
102 <option value='ppsnp'>Remove SNPs</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
103 </param>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
104 <param type='data' name='phenotype_table' format='tabular' label='Phenotype Table' help='Phenotype Table must include the following information: sampleID, phenotype and paird or unpaired samples column.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
105 <param name='maxgap_size' type='integer' value='250' label='maxGap Size' help='If cluster is not provided this maximum location gap will be used to define cluster.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
106 <param name='cutoff_size' type='float' value='0.1' label='Cutoff Size' help='A numeric value. Values of the estimate of the genomic profile above the cutoff or below the negative of the cutoff will be used as candidate regions. It is possible to give two separate values (upper and lower bounds). If one value is given, the lower bound is minus the value.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
107 <param name='number_of_resamples' type='integer' value='0' label='Number of Resamples' help='An integer denoting the number of resamples to use when computing null distributions. This defaults to 0. If permutations is supplied that defines the number of permutations/bootstraps and B is ignored.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
108 <param name='null_method' type='select' label='null Method' help='Method used to generate null candidate regions (defaults to &#x2018;permutation&#x2019;). Note that for cases with more than one covariate the permutation approach is not generally recommended. '>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
109 <option value='permutation' selected='True'>permutation</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
110 <option value='bootstrap'>bootstrap</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
111 </param>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
112 <param type='data' name='phenotype_table' format='tabular' label='Phenotype Table' help='Table of compared samples and their characteristics, may be categorical (e.g. cancer vs. normal) or continuous (e.g. blood pressure).'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
113 <param name='phenotype' type='select' label='Phenotype Type'>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
114 <option value='categorical'>categorical</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
115 <option value='continuous'>continuous</option>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
116 </param>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
117 <param name='q_cutoff' type='float' value='1' label='qCutoff Size' help='DMPs with an FDR q-value greater than this will not be returned.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
118 <param name='variance_shrinkage' type='boolean' truevalue='TRUE' falsevalue='FALSE' label='Variance Shrinkage' help='Enable variance shrinkage is recommended when sample sizes are small.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
119 <param type='data' name='ucsc_genome' format='gtf' label='Genome Table' help='Reference Sequence e.g. wgEncodeHaibMethyl450Gm12878SitesRep1.'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
120 </inputs>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
121 <outputs>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
122 <data name='qctab' format='txt' label='Quality Control Report'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
123 <data name='qcpng' format='png' label='Quality Control Plot'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
124 <data name='table' format='txt' label='SNPInfo Table'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
125 <data name='dmr' format='bed' label='Differentially Methylated Regions'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
126 <data name='dmp' format='bed' label='Differentially Methylated Positions'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
127 </outputs>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
128 <tests>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
129 <test>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
130 <param name='files_red' value='GSM1588707_8795207119_R06C02_Red.idat,GSM1588706_8795207135_R02C02_Red.idat,GSM1588705_8795207119_R05C02_Red.idat,GSM1588704_8795207135_R01C02_Red.idat' ftype='idat'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
131 <param name='files_grn' value='GSM1588707_8795207119_R06C02_Grn.idat,GSM1588706_8795207135_R02C02_Grn.idat,GSM1588705_8795207119_R05C02_Grn.idat,GSM1588704_8795207135_R01C02_Grn.idat' ftype='idat'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
132 <param name='optpp' value='ppsnp'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
133 <param name='grset' value='GRSet_without_SNPs.rdata'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
134 <param name='phenotype_table' value='phenotypeTable.txt'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
135 <param name='maxgap_size' value='250'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
136 <param name='cutoff_size' value='0.1'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
137 <param name='number_of_resamples' value='0'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
138 <param name='null_method' value='permutation'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
139 <param name='grset' value='GRSet_without_SNPs.rdata'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
140 <param name='phenotype' value='categorical'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
141 <param name='q_cutoff' value='1'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
142 <param name='variance_shrinkage' value='FALSE'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
143 <param name='ucsc_genome' value='ucsc.gtf'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
144 <output name='qctab' file='Quality_Control_Report.txt'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
145 <output name='qcpng' file='Quality_Control_Plot.png' compare='sim_size'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
146 <output name='table' file='SNPInfo_Table.txt'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
147 <output name='dmr' file='Differentially_Methylated_Regions.bed'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
148 <output name='dmp' file='Differentially_Methylated_Positions.bed'/>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
149 </test>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
150 </tests>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
151 <help><![CDATA[
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
152
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
153 .. class:: infomark
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
154
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
155 **What it does**
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
156
82
369fef559cfc "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
kpbioteam
parents: 81
diff changeset
157 The workflow combines 5 main steps, starting with raw intensity data loading (.idat) and then optional preprocessing and normalisation of the data. The next quality control step performs an additional sample check to remove low-quality data, which normalisation cannot detect. The workflow gives the user the opportunity to perform any of these preparation and data cleaning steps, including the highly recommended genetic variation annotation step resulting in single nucleotide polymorphism identification and removal. Finally, the dataset generated through all of these steps can be used to hunt (find) differentially-methylated positions (DMP)and regions (DMR) with respect to a phenotype covariate.
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
158
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
159 ***Inputs***
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
160
82
369fef559cfc "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
kpbioteam
parents: 81
diff changeset
161 *Series of .IDAT files*: matching red and green .idat file for each sample on the chip intensity data.
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
162
82
369fef559cfc "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 9363395728213b6d82e606c5513709c54af4df09"
kpbioteam
parents: 81
diff changeset
163 *(optional) Preprocessing Methods*: by this step probes can be stratified by region via quantile normalisation or by extended implementation of functional normalisation recommended for cases where global changes are expected such as in cancer-normal comparisons. In addition unwanted probes containing either an SNP at the CpG interrogation or at the single nucleotide extension can be removed (recommended).
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
164
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
165 *Phenotype Table*: table of compared samples and their characteristics, may be categorical (e.g. cancer vs. normal) or continuous (e.g. blood pressure).
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
166
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
167 ========== ============== ===============
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
168 Accession Sensitivity Treatment
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
169 ---------- -------------- ---------------
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
170 GSM1588704 sensitive MAPKi
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
171 ---------- -------------- ---------------
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
172 GSM1588705 sensitive MAPKi
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
173 ---------- -------------- ---------------
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
174 GSM1588706 resistant BRAFi
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
175 ---------- -------------- ---------------
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
176 GSM1588707 resistant BRAFi
78
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
177 ========== ============== ===============
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
178
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
179 *Note*: phenotype covariate table must include the following information:
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
180 sampleID/Accession, phenotype and paird or unpaired samples column
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
181
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
182 *Genome Table*: a reference genome that contains the nucleotide sequence of the chromosomes, It is representative of a specific genome build and release.
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
183
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
184 ***Outputs***
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
185
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
186 *Quality Control Report and Plot*: quality control (QC) outputs plot of the log median intensity in both the methylated (M) and unmethylated (U) channels. When plotting these two medians against each other the good samples cluster together, while failed samples tend to separate and have lower median intensities.
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
187
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
188 *(optional) SNPInfo Table*: matrix of the chromosome and the position of each SNP on a given Affymetrix SNP Array.
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
189
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
190 *Differentially Methylated Regions*: consecutive genomic locations differentially methylated in the same direction save as multiple track lines in a single BED file.
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
191
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
192 *Differentially Methylated Positions*: single genomic position that has a different methylated level in two different groups of samples (or conditions) save as multiple track lines in a single BED file.
e67f424d3f42 Uploaded Analysis Pipeline
kpbioteam
parents:
diff changeset
193 ]]></help>
81
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
194 <citations>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
195 <citation type='doi'>10.18129/B9.bioc.illuminaio</citation>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
196 </citations>
8ab24a5229bd "planemo upload for repository https://github.com/kpbioteam/ewas_galaxy commit 323de18b21453fa652ae3abb7e847704e13a65ed"
kpbioteam
parents: 78
diff changeset
197 </tool>