comparison mgescan.xml @ 0:803c7c39993e draft

Uploaded
author hyungrolee
date Sat, 14 Jun 2014 19:00:14 -0400
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:803c7c39993e
1 <?xml version="1.0"?>
2
3 <tool name="MGEScan" id="mgescan" version="0.0.1" workflow_compatible="false">
4 <description>
5 MGEScan
6 </description>
7 <command interpreter="bash">
8 mgescan.sh $input '$input.name' 3 $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3
9 <!-- mgescan.sh $input $input.name $hmmver $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3 -->
10 </command>
11 <inputs>
12 <param format="txt" name="input" type="data" label="From"/>
13 <!--param name="hmmver" type="select" label="Hmmsearch version">
14 <option selected="selected" value="3">3</option>
15 <option value="2">2</option>
16 </param-->
17 <param name="program" type="select" label="MGEScan">
18 <option selected="selected" value="B">Both</option>
19 <option value="L">LTR</option>
20 <option value="N">nonLTR</option>
21 </param>
22 </inputs>
23 <outputs>
24 <data format="ltr.out" name="output">
25 <filter>program != "N"</filter>
26 </data>
27 <data format="fasta" name="clade">
28 <filter>program != "L"</filter>
29 </data>
30 <data format="qfile" name="qvalue_en">
31 <filter>program != "L"</filter>
32 </data>
33 <data format="qfile" name="qvalue_rt">
34 <filter>program != "L"</filter>
35 </data>
36 <data format="gff3" name="ltr_gff3">
37 <filter>program != "N"</filter>
38 </data>
39 <data format="gff3" name="nonltr_gff3">
40 <filter>program != "L"</filter>
41 </data>
42
43 </outputs>
44 <help>
45 Running the program
46 ===================
47
48 To run MGEScan, select input genome data in From select box, and select program either LTR, nonLTR or both.
49
50 Click 'Execute' button.
51
52 If you like to have more options to run LTR or nonLTR progrma, use separated tools on the left panel.
53 In LTR > MGEScan-LTR, preprocessing by repeatmasker and setting other variables are available e.g. distance(bp) between LTRs.
54
55 Output
56 ============
57 A. MGEScan_LTR:
58 Upon completion, MGEScan-LTR generates a file "ltr.out". This output file has information
59 about clusters and coordinates of LTR retrotransposons identified. Each cluster of LTR
60 retrotransposons starts with the head line of "[cluster_number]---------", followed by
61 the information of LTR retrotransposons in the cluster. The columns for LTR
62 retrotransposons are as follows.
63
64 1. LTR_id: unique id of LTRs identified. It consist of two components, sequence file name and id in the file. For example, chr1_2 is the second LTR retrotransposon in the chr1 file.
65 2. start position of 5’ LTR.
66 3. end position of 5’ LTR.
67 4. start position of 3’ LTR.
68 5. end position of 3’ LTR.
69 6. strand: + or -.
70 7. length of 5’ LTR.
71 8. length of 3’ LTR.
72 9. length of the LTR retrotransposon.
73 10. TSD on the left side of the LTR retotransposons.
74 11. TSD on the right side of the LTR retrotransposons.
75 12. di(tri)nucleotide on the left side of 5’LTR
76 13. di(tri)nucleotide on the right side of 5’LTR
77 14. di(tri)nucleotide on the left side of 3’LTR
78 15. di(tri)nucleotide on the right side of 3’LTR
79
80 B. MGEScan_nonLTR:
81 Upon completion, MGEScan-nonLTR generates the directory, "info" in the data directory you
82 specified. In this "info" directory, two sub-directories ("full" and "validation") are
83 generated.
84
85 - The "full" directory is for storing sequences of elements. Each subdirectory in "full"
86 is the name of clade. In each directory of clade, the DNA sequences of nonLTRs identified
87 are listed. Each sequence is in fasta format. The header contains the position
88 information of TEs identified:
89 [genome_file_name]_[start position in the sequence]
90
91 For example, >chr1_333 means that this element start at 333bp in the "chr1" file.
92
93 - The "validation" directory is for storing Q values. In the files "en" and "rt", the first column corresponds to the element name and the last column Q value.
94
95 License
96 ============
97 Copyright 2014 Mina Rho, Haixu Tang.
98 You may redistribute this software under the terms of the GNU General Public License.
99
100 </help>
101 </tool>