annotate tools/regVariation/delete_overlapping_indels.xml @ 1:cdcb0ce84a1b

Uploaded
author xuebing
date Fri, 09 Mar 2012 19:45:15 -0500
parents 9071e359b9a3
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
1 <tool id="delete_overlapping_indels" name="Delete Overlapping Indels" version="1.0.0">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
2 <description>from a chromosome indels file</description>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
3
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
4 <command interpreter="perl">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
5 delete_overlapping_indels.pl $inputFile1 $inputIndelStartColumnNumber2 $inputIndelEndColumnNumber3 $outputFile1
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
6 </command>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
7
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
8 <inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
9 <param format="tabular" name="inputFile1" type="data" label="Select indels file"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
10 <param type="data_column" name="inputIndelStartColumnNumber2" data_ref="inputFile1" accept_default="true" label="Choose the indel start coordinate column number" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
11 <param type="data_column" name="inputIndelEndColumnNumber3" data_ref="inputFile1" accept_default="true" label="Choose the the indel end coordinate column number" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
12 </inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
13
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
14 <outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
15 <data format="tabular" name="outputFile1"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
16 </outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
17
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
18 <tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
19 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
20 <param name="inputFile1" value="indels1.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
21 <param name="inputIndelStartColumnNumber2" value="5" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
22 <param name="inputIndelEndColumnNumber3" value="6" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
23 <output name="outputFile1" file="non_overlapping_indels1.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
24 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
25 </tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
26
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
27 <help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
28
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
29 .. class:: infomark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
30
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
31 **What it does**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
32
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
33 This program detects overlapping indels in a chromosome and keeps all non-overlapping indels. As for overlapping indels, the first encountered one is kept and all others are removed.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
34 It requires three inputs:
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
35
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
36 - The first input is a TABULAR format file containing coordinates of indels in blocks extracted from multi-alignment.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
37 - The second input is an integer number representing the number of the column where indel start coordinates are stored in the input file.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
38 - The third input is an integer number representing the number of the column where indel end coordinates are stored in the input file.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
39 - The output is a TABULAR format file containing all non-overlapping indels in the input file, and the first encountered indel of overlapping ones.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
40
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
41 Note: The number of the first column is 1.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
42
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
43
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
44 **Example**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
45
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
46 Let us have the following insertions in the human genome. The start and end coordinates of insertions are on columns 5 and 6 respectively::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
47
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
48 3 hg18.chr22_insert 3 hg18.chr22 14508610 14508612 3924 - panTro2.chr2b 132518950 132518951 3910 + rheMac2.chr17 14311798 14311799 3896 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
49 7 hg18.chr22_insert 13 hg18.chr22 14513678 14513690 348 - panTro2.chr2b 132517876 132517877 321 + rheMac2.chr17 14274462 14274463 337 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
50 7 hg18.chr22_insert 6 hg18.chr22 14513688 14513699 348 - panTro2.chr2b 132517879 132517880 321 + rheMac2.chr17 14274465 14274466 337 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
51 25 hg18.chr22_insert 9 hg18.chr22 14529501 14529509 385 - panTro2.chr22 14528775 14528776 376 - rheMac2.chr9 42869449 42869450 375 -
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
52 36 hg18.chr22_insert 4 hg18.chr22 14566316 14566319 540 - panTro2.chr2b 132492077 132492078 533 + rheMac2.chr10 59230438 59230439 533 -
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
53 40 hg18.chr22_insert 7 hg18.chr22 14508610 14508616 2337 - panTro2.chr2b 132487750 132487751 2313 + rheMac2.chr10 59128305 59128306 2332 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
54 41 hg18.chr22_insert 4 hg18.chr22 14571556 14571559 2483 - panTro2.chr2b 132485878 132485879 2481 + rheMac2.chr10 59126094 59126095 2508 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
55
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
56 By removing the overlapping indels which, we get::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
57
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
58 3 hg18.chr22_insert 3 hg18.chr22 14508610 14508612 3924 - panTro2.chr2b 132518950 132518951 3910 + rheMac2.chr17 14311798 14311799 3896 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
59 7 hg18.chr22_insert 13 hg18.chr22 14513678 14513690 348 - panTro2.chr2b 132517876 132517877 321 + rheMac2.chr17 14274462 14274463 337 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
60 25 hg18.chr22_insert 9 hg18.chr22 14529501 14529509 385 - panTro2.chr22 14528775 14528776 376 - rheMac2.chr9 42869449 42869450 375 -
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
61 36 hg18.chr22_insert 4 hg18.chr22 14566316 14566319 540 - panTro2.chr2b 132492077 132492078 533 + rheMac2.chr10 59230438 59230439 533 -
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
62 41 hg18.chr22_insert 4 hg18.chr22 14571556 14571559 2483 - panTro2.chr2b 132485878 132485879 2481 + rheMac2.chr10 59126094 59126095 2508 +
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
63
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
64 </help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
65
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
66 </tool>