Mercurial > repos > nilesh > rseqc
annotate geneBody_coverage.xml @ 48:2e6190c29c54 draft
Upload tar.gz again - toolshed removed all files (grrr...)
author | lparsons |
---|---|
date | Tue, 21 Apr 2015 10:27:06 -0400 |
parents | |
children | 6b33e31bda10 |
rev | line source |
---|---|
48
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
1 <tool id="rseqc_geneBody_coverage" name="Gene Body Converage (BAM)" version="2.4"> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
2 <description> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
3 Read coverage over gene body. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
4 </description> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
5 <requirements> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
6 <requirement type="package" version="3.0.3">R</requirement> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
7 <requirement type="package" version="1.7.1">numpy</requirement> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
8 <requirement type="package" version="2.4">rseqc</requirement> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
9 </requirements> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
10 <command> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
11 #set $safename = ''.join(c in '_0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ' and c or '_' for c in $input.display_name) |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
12 #set $fname = "d1_" + str($safename) + ".bam" |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
13 ln -s '${input}' '${fname}' && |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
14 ln -s '${input.metadata.bam_index}' '${fname}.bai' && |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
15 echo '${fname}' > input_list.txt && |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
16 #for $i, $additional_input in enumerate($additionalinputs): |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
17 #set $index = $i+2 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
18 #set $safename = ''.join(c in '_0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ' and c or '_' for c in $additional_input.file.display_name) |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
19 #set $fname = 'd' + str($index) + '_' + str($safename) + ".bam" |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
20 ln -s '$additional_input.file' '${fname}' && |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
21 ln -s '$additional_input.file.metadata.bam_index' '${fname}.bai' && |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
22 echo '${fname}' >> input_list.txt && |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
23 #end for |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
24 geneBody_coverage.py -i input_list.txt -r $refgene --minimum_length $minimum_length -o output |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
25 </command> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
26 <stdio> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
27 <exit_code range="1:" level="fatal" description="An error occured during execution, see stderr and stdout for more information" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
28 <regex match="[Ee]rror" source="both" description="An error occured during execution, see stderr and stdout for more information" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
29 </stdio> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
30 <inputs> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
31 <param name="input" type="data" label="Additional input .bam files" format="bam" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
32 <repeat name="additionalinputs" title="Input .bam file"> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
33 <param name="file" type="data" label="Input .bam file" format="bam" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
34 </repeat> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
35 <param name="refgene" type="data" label="Reference Genome" format="bed" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
36 <param name="minimum_length" type="integer" value="100" label="Minimum mRNA length" help="Minimum mRNA length (bp). mRNA that are shorter than this value will be skipped (default is 100)." /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
37 </inputs> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
38 <outputs> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
39 <data name="outputcurvespdf" format="pdf" from_work_dir="output.geneBodyCoverage.curves.pdf" label="${tool.name} on ${on_string} (Curves PDF)" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
40 <data name="outputheatmappdf" format="pdf" from_work_dir="output.geneBodyCoverage.heatMap.pdf" label="${tool.name} on ${on_string} (HeatMap PDF)"> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
41 <filter>len(additionalinputs) >= 2</filter> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
42 </data> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
43 <data name="outputr" format="txt" from_work_dir="output.geneBodyCoverage.r" label="${tool.name} on ${on_string} (R Script)" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
44 <data name="outputtxt" format="txt" from_work_dir="output.geneBodyCoverage.txt" label="${tool.name} on ${on_string} (Text)" /> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
45 </outputs> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
46 <help> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
47 geneBody_coverage.py |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
48 ++++++++++++++++++++ |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
49 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
50 Read coverage over gene body. This module is used to check if read coverage is uniform and if there is any 5\'/3\' bias. This module scales all transcripts to 100 nt and calculates the number of reads covering each nucleotide position. Finally, it generates plots illustrating the coverage profile along the gene body. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
51 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
52 If 3 or more BAM files were provided. This program generates a lineGraph and a heatmap. If fewer than 3 BAM files were provided, only lineGraph is generated. See below for examples. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
53 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
54 When heatmap is generated, samples are ranked by the "skewness" of the coverage: Sample with best (worst) coverage will be displayed at the top (bottom) of the heatmap. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
55 Coverage skewness was measured by `Pearson’s skewness coefficients <http://en.wikipedia.org/wiki/Skewness#Pearson.27s_skewness_coefficients>`_ |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
56 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
57 Inputs |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
58 ++++++++++++++ |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
59 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
60 Input BAM/SAM file |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
61 Alignment file in BAM/SAM format. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
62 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
63 Reference gene model |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
64 Gene Model in BED format. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
65 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
66 Minimum mRNA length |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
67 Minimum mRNA length (bp). mRNA that are shorter than this value will be skipped (default is 100). |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
68 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
69 Outputs |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
70 ++++++++++++++ |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
71 Text |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
72 Table that includes the data used to generate the plots |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
73 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
74 R Script |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
75 R script file that reads the data and generates the plot |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
76 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
77 PDF |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
78 The final plot, in PDF format |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
79 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
80 Example plots: |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
81 .. image:: http://rseqc.sourceforge.net/_images/Aug_26.geneBodyCoverage.curves.png |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
82 :height: 600 px |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
83 :width: 600 px |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
84 :scale: 80 % |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
85 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
86 .. image:: http://rseqc.sourceforge.net/_images/Aug_26.geneBodyCoverage.heatMap.png |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
87 :height: 600 px |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
88 :width: 600 px |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
89 :scale: 80 % |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
90 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
91 ----- |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
92 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
93 About RSeQC |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
94 +++++++++++ |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
95 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
96 The RSeQC_ package provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data. "Basic modules" quickly inspect sequence quality, nucleotide composition bias, PCR bias and GC bias, while "RNA-seq specific modules" investigate sequencing saturation status of both splicing junction detection and expression estimation, mapped reads clipping profile, mapped reads distribution, coverage uniformity over gene body, reproducibility, strand specificity and splice junction annotation. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
97 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
98 The RSeQC package is licensed under the GNU GPL v3 license. |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
99 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
100 .. image:: http://rseqc.sourceforge.net/_static/logo.png |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
101 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
102 .. _RSeQC: http://rseqc.sourceforge.net/ |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
103 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
104 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
105 |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
106 </help> |
2e6190c29c54
Upload tar.gz again - toolshed removed all files (grrr...)
lparsons
parents:
diff
changeset
|
107 </tool> |