annotate seqid_uncollapser.xml @ 1:e7c65e398bdd draft default tip

Deleted selected files
author idot
date Wed, 10 Jul 2013 06:16:21 -0400
parents 78a7d28f2a15
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
1 <tool id="cshl_seqid_uncollapser" name="Uncollapse rows">
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
2 <description>containing collapsed sequence IDs</description>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
3 <command>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
4 cat '$input' |
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
5 fastx_uncollapser -c $idcol -v -o '$output'
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
6 </command>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
7 <inputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
8 <param format="tabular,pslx" name="input" type="data" label="Library to uncollapse" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
9 <param name="idcol" label="Column with collased sequence-identifier" type="data_column" data_ref="input" accept_default="false" >
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
10 <help>This column contains the sequence id from a collapsed FASTA file in the form of "(seq number)-(read count)" (e.g. 15-4). Use 10 if you're analyzing BLAT output</help>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
11 </param>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
12 </inputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
13 <tests>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
14 <test>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
15 <param name="input" value="fastx_seqid_uncollapse1.psl" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
16 <param name="idcol" value="10" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
17 <param name="output" file="fastx_seqid_uncollapse1.out" />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
18 </test>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
19 </tests>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
20
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
21 <outputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
22 <data format="input" name="output" metadata_source="input"
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
23 />
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
24 </outputs>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
25 <help>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
26
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
27 **What it does**
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
28
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
29 This tool reads a row (in a table) containing a collapsed sequence ID, and duplicates the .
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
30
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
31 .. class:: warningmark
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
32
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
33 You must specify the column containing the collapsed sequence ID (e.g. 15-4).
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
34
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
35 --------
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
36
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
37 **Example Input File**
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
38
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
39 The following input file contains two collapsed sequence identifiers at column 10: *84-2* and *87-5*
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
40
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
41 (meaning the first has multiplicity-count of 2 and the second has multiplicity count of 5)::
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
42
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
43
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
44 23 0 0 0 0 0 0 0 + 84-2 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
45 22 0 0 0 0 0 0 0 + 87-5 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
46
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
47
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
48 **Output Example**
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
49
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
50 After **uncollapsing** (on column 10), the line of the first sequence-identifier is repeated *twice*, and the line of the second sequence-identifier is repeated *five* times::
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
51
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
52 23 0 0 0 0 0 0 0 + 84-2 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
53 23 0 0 0 0 0 0 0 + 84-2 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
54 22 0 0 0 0 0 0 0 + 87-5 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
55 22 0 0 0 0 0 0 0 + 87-5 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
56 22 0 0 0 0 0 0 0 + 87-5 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
57 22 0 0 0 0 0 0 0 + 87-5 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
58 22 0 0 0 0 0 0 0 + 87-5 ...
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
59
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
60
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
61 Uncollapsing a text file allows analsys of collapsed FASTA files to be used with any tool which doesn't 'understand' collapsed multiplicity counts.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
62
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
63 .. class:: infomark
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
64
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
65 See the *Collapse* tool in the *FASTA Manipulation* category for more details about collapsing FASTA files.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
66
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
67 -----
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
68
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
69 This tool is based on `FASTX-toolkit`__ by Assaf Gordon.
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
70
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
71 .. __: http://hannonlab.cshl.edu/fastx_toolkit/
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
72
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
73 </help>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
74 </tool>
78a7d28f2a15 Uploaded
idot
parents:
diff changeset
75 <!-- FASTX-Uncollapser is part of the FASTX-toolkit, by A.Gordon (gordon@cshl.edu) -->