comparison read_quality.xml @ 45:eb339c5849bb draft

Reupload, toolshed removed all files of previous version.
author lparsons
date Fri, 26 Sep 2014 15:04:18 -0400
parents
children 6b33e31bda10
comparison
equal deleted inserted replaced
44:20e0b2dd7882 45:eb339c5849bb
1 <tool id="rseqc_read_quality" name="Read Quality" version="2.4">
2 <description>determines Phred quality score</description>
3 <requirements>
4 <requirement type="package" version="3.0.3">R</requirement>
5 <requirement type="package" version="1.7.1">numpy</requirement>
6 <requirement type="package" version="2.4">rseqc</requirement>
7 </requirements>
8 <command>
9 read_quality.py -i $input -o output -r $reduce
10 </command>
11 <stdio>
12 <exit_code range="1:" level="fatal" description="An error occured during execution, see stderr and stdout for more information" />
13 <regex match="[Ee]rror" source="both" description="An error occured during execution, see stderr and stdout for more information" />
14 </stdio>
15 <inputs>
16 <param name="input" type="data" format="bam,sam" label="input bam/sam file" />
17 <param name="reduce" type="integer" label="Ignore Phred scores less than this amount (only applies to 'boxplot', default=1000)" value="1000" />
18 </inputs>
19 <outputs>
20 <data format="txt" name="outputr" from_work_dir="output.qual.r" label="${tool.name} on ${on_string} (R Script)" />
21 <data format="pdf" name="outputpdf" from_work_dir="output.qual.heatmap.pdf" label="${tool.name} on ${on_string} (Heatmap PDF)" />
22 <data format="pdf" name="outputpdf" from_work_dir="output.qual.boxplot.pdf" label="${tool.name} on ${on_string} (Boxplot PDF)" />
23 </outputs>
24 <help>
25 read_quality.py
26 +++++++++++++++
27
28 According to SAM specification, if Q is the character to represent "base calling quality"
29 in SAM file, then Phred Quality Score = ord(Q) - 33. Here ord() is python function that
30 returns an integer representing the Unicode code point of the character when the argument
31 is a unicode object, for example, ord('a') returns 97. Phred quality score is widely used
32 to measure "reliability" of base-calling, for example, phred quality score of 20 means
33 there is 1/100 chance that the base-calling is wrong, phred quality score of 30 means there
34 is 1/1000 chance that the base-calling is wrong. In general: Phred quality score = -10xlog(10)P,
35 here P is probability that base-calling is wrong.
36
37 Inputs
38 ++++++++++++++
39
40 Input BAM/SAM file
41 Alignment file in BAM/SAM format.
42
43 Ignore phred scores less than this number (default=1000)
44 To avoid making huge vector in R, nucleotide with certain phred score represented less than this number will be ignored. Increase this number save more memory while reduce precision. This option only applies to the 'boxplot'.
45
46 Output
47 ++++++++++++++
48
49 1. output.qual.r
50 2. output.qual.boxplot.pdf
51 .. image:: http://rseqc.sourceforge.net/_images/36mer.qual.plot.png
52 :height: 600 px
53 :width: 600 px
54 :scale: 80 %
55 3. output.qual.heatmap.pdf
56 .. image:: http://rseqc.sourceforge.net/_images/36mer.qual.heatmap.png
57 :height: 600 px
58 :width: 600 px
59 :scale: 80 %
60
61 Heatmap: use different color to represent nucleotide density ("blue"=low density,"orange"=median density,"red"=high density")
62
63 -----
64
65 About RSeQC
66 +++++++++++
67
68 The RSeQC_ package provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data. "Basic modules" quickly inspect sequence quality, nucleotide composition bias, PCR bias and GC bias, while "RNA-seq specific modules" investigate sequencing saturation status of both splicing junction detection and expression estimation, mapped reads clipping profile, mapped reads distribution, coverage uniformity over gene body, reproducibility, strand specificity and splice junction annotation.
69
70 The RSeQC package is licensed under the GNU GPL v3 license.
71
72 .. image:: http://rseqc.sourceforge.net/_static/logo.png
73
74 .. _RSeQC: http://rseqc.sourceforge.net/
75
76
77 </help>
78 </tool>