annotate fastq_quality_filter.xml @ 1:e7c65e398bdd draft default tip

Deleted selected files
author idot
date Wed, 10 Jul 2013 06:16:21 -0400
parents 78a7d28f2a15
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
1 <tool id="cshl_fastq_quality_filter" name="Filter by quality">
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
2 <description></description>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
3
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
4 <command>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
5 cat '$input' |
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
6 fastq_quality_filter
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
7 #if $input.ext == "fastqsanger":
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
8 -Q 33
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
9 #elif $input.ext == "fastq":
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
10 -Q 64
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
11 #end if
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
12 -q $quality -p $percent -v -o '$output'
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
13 </command>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
14 <inputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
15 <param format="fastq,fastqsanger" name="input" type="data" label="Library to filter" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
16
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
17 <param name="quality" size="4" type="integer" value="20">
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
18 <label>Quality cut-off value</label>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
19 </param>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
20
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
21 <param name="percent" size="4" type="integer" value="90">
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
22 <label>Percent of bases in sequence that must have quality equal to / higher than cut-off value</label>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
23 </param>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
24 </inputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
25
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
26 <tests>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
27 <test>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
28 <!-- Test1: 100% of bases with quality 33 or higher (pretty steep requirement...) -->
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
29 <param name="input" value="fastq_qual_filter1.fastq" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
30 <param name="quality" value="33"/>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
31 <param name="percent" value="100"/>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
32 <output name="output" file="fastq_qual_filter1a.out" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
33 </test>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
34 <test>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
35 <!-- Test2: 80% of bases with quality 20 or higher -->
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
36 <param name="input" value="fastq_qual_filter1.fastq" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
37 <param name="quality" value="20"/>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
38 <param name="percent" value="80"/>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
39 <output name="output" file="fastq_qual_filter1b.out" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
40 </test>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
41 </tests>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
42
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
43 <outputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
44 <data format="input" name="output" metadata_source="input"
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
45 />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
46 </outputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
47
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
48 <help>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
49 **What it does**
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
50
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
51 This tool filters reads based on quality scores.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
52
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
53 .. class:: infomark
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
54
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
55 Using **percent = 100** requires all cycles of all reads to be at least the quality cut-off value.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
56
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
57 .. class:: infomark
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
58
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
59 Using **percent = 50** requires the median quality of the cycles (in each read) to be at least the quality cut-off value.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
60
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
61 --------
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
62
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
63 Quality score distribution (of all cycles) is calculated for each read. If it is lower than the quality cut-off value - the read is discarded.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
64
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
65
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
66 **Example**::
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
67
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
68 @CSHL_4_FC042AGOOII:1:2:214:584
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
69 GACAATAAAC
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
70 +CSHL_4_FC042AGOOII:1:2:214:584
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
71 30 30 30 30 30 30 30 30 20 10
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
72
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
73 Using **percent = 50** and **cut-off = 30** - This read will not be discarded (the median quality is higher than 30).
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
74
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
75 Using **percent = 90** and **cut-off = 30** - This read will be discarded (90% of the cycles do no have quality equal to / higher than 30).
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
76
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
77 Using **percent = 100** and **cut-off = 20** - This read will be discarded (not all cycles have quality equal to / higher than 20).
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
78
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
79 ------
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
80
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
81 This tool is based on `FASTX-toolkit`__ by Assaf Gordon.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
82
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
83 .. __: http://hannonlab.cshl.edu/fastx_toolkit/
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
84 </help>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
85 </tool>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
86 <!-- FASTQ-Quality-Filter is part of the FASTX-toolkit, by A.Gordon (gordon@cshl.edu) -->