Mercurial > repos > hyungrolee > mgescan
comparison mgescan.xml @ 0:c9fe27da991c draft
Uploaded
author | hyungrolee |
---|---|
date | Sat, 14 Jun 2014 19:04:23 -0400 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:c9fe27da991c |
---|---|
1 <?xml version="1.0"?> | |
2 | |
3 <tool name="MGEScan" id="mgescan" version="0.0.1" workflow_compatible="false"> | |
4 <description> | |
5 MGEScan | |
6 </description> | |
7 <command interpreter="bash"> | |
8 mgescan.sh $input '$input.name' 3 $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3 | |
9 <!-- mgescan.sh $input $input.name $hmmver $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3 --> | |
10 </command> | |
11 <inputs> | |
12 <param format="txt" name="input" type="data" label="From"/> | |
13 <!--param name="hmmver" type="select" label="Hmmsearch version"> | |
14 <option selected="selected" value="3">3</option> | |
15 <option value="2">2</option> | |
16 </param--> | |
17 <param name="program" type="select" label="MGEScan"> | |
18 <option selected="selected" value="B">Both</option> | |
19 <option value="L">LTR</option> | |
20 <option value="N">nonLTR</option> | |
21 </param> | |
22 </inputs> | |
23 <outputs> | |
24 <data format="ltr.out" name="output"> | |
25 <filter>program != "N"</filter> | |
26 </data> | |
27 <data format="fasta" name="clade"> | |
28 <filter>program != "L"</filter> | |
29 </data> | |
30 <data format="qfile" name="qvalue_en"> | |
31 <filter>program != "L"</filter> | |
32 </data> | |
33 <data format="qfile" name="qvalue_rt"> | |
34 <filter>program != "L"</filter> | |
35 </data> | |
36 <data format="gff3" name="ltr_gff3"> | |
37 <filter>program != "N"</filter> | |
38 </data> | |
39 <data format="gff3" name="nonltr_gff3"> | |
40 <filter>program != "L"</filter> | |
41 </data> | |
42 | |
43 </outputs> | |
44 <help> | |
45 Running the program | |
46 =================== | |
47 | |
48 To run MGEScan, select input genome data in From select box, and select program either LTR, nonLTR or both. | |
49 | |
50 Click 'Execute' button. | |
51 | |
52 If you like to have more options to run LTR or nonLTR progrma, use separated tools on the left panel. | |
53 In LTR > MGEScan-LTR, preprocessing by repeatmasker and setting other variables are available e.g. distance(bp) between LTRs. | |
54 | |
55 Output | |
56 ============ | |
57 A. MGEScan_LTR: | |
58 Upon completion, MGEScan-LTR generates a file "ltr.out". This output file has information | |
59 about clusters and coordinates of LTR retrotransposons identified. Each cluster of LTR | |
60 retrotransposons starts with the head line of "[cluster_number]---------", followed by | |
61 the information of LTR retrotransposons in the cluster. The columns for LTR | |
62 retrotransposons are as follows. | |
63 | |
64 1. LTR_id: unique id of LTRs identified. It consist of two components, sequence file name and id in the file. For example, chr1_2 is the second LTR retrotransposon in the chr1 file. | |
65 2. start position of 5’ LTR. | |
66 3. end position of 5’ LTR. | |
67 4. start position of 3’ LTR. | |
68 5. end position of 3’ LTR. | |
69 6. strand: + or -. | |
70 7. length of 5’ LTR. | |
71 8. length of 3’ LTR. | |
72 9. length of the LTR retrotransposon. | |
73 10. TSD on the left side of the LTR retotransposons. | |
74 11. TSD on the right side of the LTR retrotransposons. | |
75 12. di(tri)nucleotide on the left side of 5’LTR | |
76 13. di(tri)nucleotide on the right side of 5’LTR | |
77 14. di(tri)nucleotide on the left side of 3’LTR | |
78 15. di(tri)nucleotide on the right side of 3’LTR | |
79 | |
80 B. MGEScan_nonLTR: | |
81 Upon completion, MGEScan-nonLTR generates the directory, "info" in the data directory you | |
82 specified. In this "info" directory, two sub-directories ("full" and "validation") are | |
83 generated. | |
84 | |
85 - The "full" directory is for storing sequences of elements. Each subdirectory in "full" | |
86 is the name of clade. In each directory of clade, the DNA sequences of nonLTRs identified | |
87 are listed. Each sequence is in fasta format. The header contains the position | |
88 information of TEs identified: | |
89 [genome_file_name]_[start position in the sequence] | |
90 | |
91 For example, >chr1_333 means that this element start at 333bp in the "chr1" file. | |
92 | |
93 - The "validation" directory is for storing Q values. In the files "en" and "rt", the first column corresponds to the element name and the last column Q value. | |
94 | |
95 License | |
96 ============ | |
97 Copyright 2014 Mina Rho, Haixu Tang. | |
98 You may redistribute this software under the terms of the GNU General Public License. | |
99 | |
100 </help> | |
101 </tool> |