annotate fastq_quality_filter.xml @ 1:57adb07cccfc

Removed excess version attributes.
author Dave Bouvier <dave@bx.psu.edu>
date Tue, 03 Dec 2013 12:36:13 -0500
parents b145b9b26648
children e41385662e5e
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
1 <tool id="cshl_fastq_quality_filter" version="1.0.0" name="Filter by quality">
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
2 <description></description>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
3 <requirements>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
4 <requirement type="package" version="0.0.13">fastx_toolkit</requirement>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
5 </requirements>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
6 <command>zcat -f '$input' | fastq_quality_filter -q $quality -p $percent -v -o $output
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
7 #if $input.ext == "fastqsanger":
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
8 -Q 33
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
9 #end if
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
10 </command>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
11
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
12 <inputs>
1
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
13 <param format="fastqsolexa,fastqsanger" name="input" type="data" label="Library to filter" />
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
14
1
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
15 <param name="quality" size="4" type="integer" value="20">
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
16 <label>Quality cut-off value</label>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
17 </param>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
18
1
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
19 <param name="percent" size="4" type="integer" value="90">
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
20 <label>Percent of bases in sequence that must have quality equal to / higher than cut-off value</label>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
21 </param>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
22 </inputs>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
23
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
24 <tests>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
25 <test>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
26 <!-- Test1: 100% of bases with quality 33 or higher (pretty steep requirement...) -->
1
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
27 <param name="input" value="fastq_qual_filter1.fastq" ftype="fastqsolexa" />
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
28 <param name="quality" value="33"/>
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
29 <param name="percent" value="100"/>
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
30 <param name="output" file="fastq_qual_filter1a.out" />
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
31 </test>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
32 <test>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
33 <!-- Test2: 80% of bases with quality 20 or higher -->
1
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
34 <param name="input" value="fastq_qual_filter1.fastq" ftype="fastqsolexa"/>
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
35 <param name="quality" value="20"/>
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
36 <param name="percent" value="80"/>
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
37 <param name="output" file="fastq_qual_filter1b.out" />
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
38 </test>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
39 </tests>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
40
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
41 <outputs>
1
57adb07cccfc Removed excess version attributes.
Dave Bouvier <dave@bx.psu.edu>
parents: 0
diff changeset
42 <data format="input" name="output" metadata_source="input" />
0
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
43 </outputs>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
44
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
45 <help>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
46 **What it does**
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
47
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
48 This tool filters reads based on quality scores.
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
49
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
50 .. class:: infomark
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
51
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
52 Using **percent = 100** requires all cycles of all reads to be at least the quality cut-off value.
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
53
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
54 .. class:: infomark
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
55
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
56 Using **percent = 50** requires the median quality of the cycles (in each read) to be at least the quality cut-off value.
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
57
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
58 --------
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
59
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
60 Quality score distribution (of all cycles) is calculated for each read. If it is lower than the quality cut-off value - the read is discarded.
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
61
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
62
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
63 **Example**::
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
64
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
65 @CSHL_4_FC042AGOOII:1:2:214:584
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
66 GACAATAAAC
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
67 +CSHL_4_FC042AGOOII:1:2:214:584
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
68 30 30 30 30 30 30 30 30 20 10
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
69
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
70 Using **percent = 50** and **cut-off = 30** - This read will not be discarded (the median quality is higher than 30).
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
71
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
72 Using **percent = 90** and **cut-off = 30** - This read will be discarded (90% of the cycles do no have quality equal to / higher than 30).
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
73
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
74 Using **percent = 100** and **cut-off = 20** - This read will be discarded (not all cycles have quality equal to / higher than 20).
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
75
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
76 ------
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
77
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
78 This tool is based on `FASTX-toolkit`__ by Assaf Gordon.
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
79
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
80 .. __: http://hannonlab.cshl.edu/fastx_toolkit/
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
81 </help>
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
82 <!-- FASTQ-Quality-Filter is part of the FASTX-toolkit, by A.Gordon (gordon@cshl.edu) -->
b145b9b26648 Uploaded tool tarball.
devteam
parents:
diff changeset
83 </tool>