annotate mgescan.xml @ 0:803c7c39993e draft

Uploaded
author hyungrolee
date Sat, 14 Jun 2014 19:00:14 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
1 <?xml version="1.0"?>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
2
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
3 <tool name="MGEScan" id="mgescan" version="0.0.1" workflow_compatible="false">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
4 <description>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
5 MGEScan
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
6 </description>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
7 <command interpreter="bash">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
8 mgescan.sh $input '$input.name' 3 $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
9 <!-- mgescan.sh $input $input.name $hmmver $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3 -->
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
10 </command>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
11 <inputs>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
12 <param format="txt" name="input" type="data" label="From"/>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
13 <!--param name="hmmver" type="select" label="Hmmsearch version">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
14 <option selected="selected" value="3">3</option>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
15 <option value="2">2</option>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
16 </param-->
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
17 <param name="program" type="select" label="MGEScan">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
18 <option selected="selected" value="B">Both</option>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
19 <option value="L">LTR</option>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
20 <option value="N">nonLTR</option>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
21 </param>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
22 </inputs>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
23 <outputs>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
24 <data format="ltr.out" name="output">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
25 <filter>program != "N"</filter>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
26 </data>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
27 <data format="fasta" name="clade">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
28 <filter>program != "L"</filter>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
29 </data>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
30 <data format="qfile" name="qvalue_en">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
31 <filter>program != "L"</filter>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
32 </data>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
33 <data format="qfile" name="qvalue_rt">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
34 <filter>program != "L"</filter>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
35 </data>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
36 <data format="gff3" name="ltr_gff3">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
37 <filter>program != "N"</filter>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
38 </data>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
39 <data format="gff3" name="nonltr_gff3">
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
40 <filter>program != "L"</filter>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
41 </data>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
42
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
43 </outputs>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
44 <help>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
45 Running the program
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
46 ===================
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
47
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
48 To run MGEScan, select input genome data in From select box, and select program either LTR, nonLTR or both.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
49
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
50 Click 'Execute' button.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
51
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
52 If you like to have more options to run LTR or nonLTR progrma, use separated tools on the left panel.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
53 In LTR > MGEScan-LTR, preprocessing by repeatmasker and setting other variables are available e.g. distance(bp) between LTRs.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
54
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
55 Output
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
56 ============
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
57 A. MGEScan_LTR:
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
58 Upon completion, MGEScan-LTR generates a file "ltr.out". This output file has information
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
59 about clusters and coordinates of LTR retrotransposons identified. Each cluster of LTR
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
60 retrotransposons starts with the head line of "[cluster_number]---------", followed by
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
61 the information of LTR retrotransposons in the cluster. The columns for LTR
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
62 retrotransposons are as follows.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
63
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
64 1. LTR_id: unique id of LTRs identified. It consist of two components, sequence file name and id in the file. For example, chr1_2 is the second LTR retrotransposon in the chr1 file.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
65 2. start position of 5’ LTR.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
66 3. end position of 5’ LTR.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
67 4. start position of 3’ LTR.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
68 5. end position of 3’ LTR.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
69 6. strand: + or -.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
70 7. length of 5’ LTR.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
71 8. length of 3’ LTR.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
72 9. length of the LTR retrotransposon.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
73 10. TSD on the left side of the LTR retotransposons.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
74 11. TSD on the right side of the LTR retrotransposons.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
75 12. di(tri)nucleotide on the left side of 5’LTR
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
76 13. di(tri)nucleotide on the right side of 5’LTR
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
77 14. di(tri)nucleotide on the left side of 3’LTR
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
78 15. di(tri)nucleotide on the right side of 3’LTR
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
79
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
80 B. MGEScan_nonLTR:
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
81 Upon completion, MGEScan-nonLTR generates the directory, "info" in the data directory you
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
82 specified. In this "info" directory, two sub-directories ("full" and "validation") are
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
83 generated.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
84
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
85 - The "full" directory is for storing sequences of elements. Each subdirectory in "full"
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
86 is the name of clade. In each directory of clade, the DNA sequences of nonLTRs identified
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
87 are listed. Each sequence is in fasta format. The header contains the position
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
88 information of TEs identified:
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
89 [genome_file_name]_[start position in the sequence]
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
90
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
91 For example, >chr1_333 means that this element start at 333bp in the "chr1" file.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
92
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
93 - The "validation" directory is for storing Q values. In the files "en" and "rt", the first column corresponds to the element name and the last column Q value.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
94
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
95 License
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
96 ============
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
97 Copyright 2014 Mina Rho, Haixu Tang.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
98 You may redistribute this software under the terms of the GNU General Public License.
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
99
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
100 </help>
803c7c39993e Uploaded
hyungrolee
parents:
diff changeset
101 </tool>