comparison mgescan.xml @ 5:7658ef159fd8 draft

Uploaded
author hyungrolee
date Mon, 15 Feb 2016 03:33:19 -0500
parents
children 106ee0841650
comparison
equal deleted inserted replaced
4:f9be9a13e01a 5:7658ef159fd8
1 <?xml version="1.0"?>
2
3 <tool name="MGEScan" id="mgescan" version="0.0.2">
4 <description>
5 MGEScan
6 </description>
7 <requirements>
8 <requirement type="package" version="3.0">mgescan</requirement>
9 </requirements>
10 <version_command>mgescan --version</version_command>
11 <command interpreter="bash">
12 mgescan.sh $input '$input.name' 3 $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3 $both_gff3 $mpi_yn.nmpi
13 <!-- mgescan.sh $input $input.name $hmmver $output $program $clade $qvalue_en $qvalue_rt $ltr_gff3 $nonltr_gff3 -->
14 </command>
15 <inputs>
16 <param format="fasta,tabular,data" name="input" type="data" label="Input FASTA file(s)"/>
17 <!--param name="hmmver" type="select" label="Hmmsearch version">
18 <option selected="selected" value="3">3</option>
19 <option value="2">2</option>
20 </param-->
21 <param name="program" type="select" label="MGEScan">
22 <option selected="selected" value="B">Both</option>
23 <option value="L">LTR</option>
24 <option value="N">nonLTR</option>
25 </param>
26 <conditional name="mpi_yn">
27 <param name="mpi_select" type="select" label="Enable MPI">
28 <option value="no_mpi">No</option>
29 <option value="yes_mpi">Yes</option>
30 </param>
31 <when value="yes_mpi">
32 <param name="nmpi" format="txt" type="text" value="1" label="Number of MPI Processes"/>
33 </when>
34 <when value="no_mpi">
35 <param name="nmpi" type="hidden" value="0"/>
36 </when>
37 </conditional>
38 </inputs>
39 <outputs>
40 <data format="ltr.out" name="output" label="LTR Results (ltr.out)">
41 <filter>program != "N"</filter>
42 </data>
43 <data format="fasta" name="clade" label="clade file (FASTA)">
44 <filter>program != "L"</filter>
45 </data>
46 <data format="qfile" name="qvalue_en" label="qvalue_en">
47 <filter>program != "L"</filter>
48 </data>
49 <data format="qfile" name="qvalue_rt" label="qvalue_rt">
50 <filter>program != "L"</filter>
51 </data>
52 <data format="gff3" name="ltr_gff3" label="GFF3 for LTR">
53 <filter>program != "N"</filter>
54 </data>
55 <data format="gff3" name="nonltr_gff3" label="GFF3 for nonLTR">
56 <filter>program != "L"</filter>
57 </data>
58 <data format="gff3" name="both_gff3" label="GFF3 for LTR and nonLTR">
59 <filter>program == "B"</filter>
60 </data>
61
62 </outputs>
63 <help>
64 How to Run MGEScan
65 ===================
66
67 * Select an input genome data from the select box, and choose a program. Both LTR and nonLTR of MGEScan is default.
68 * Click 'Execute' button.
69 * MPI will be enabled depending on your system support.
70
71 If you like to have more options to run LTR or nonLTR program, use separated tools on the left panel.
72
73 For example, in LTR > MGEScan-LTR, preprocessing by repeatmasker and setting other variables are available e.g. distance(bp) between LTRs.
74
75 Output
76 ============
77
78 A. MGEScan_LTR:
79
80 Upon completion, MGEScan-LTR generates a file "ltr.out". This output file has information
81 about clusters and coordinates of LTR retrotransposons identified. Each cluster of LTR
82 retrotransposons starts with the head line of "[cluster_number]---------", followed by
83 the information of LTR retrotransposons in the cluster. The columns for LTR
84 retrotransposons are as follows.
85
86 1. LTR_id: unique id of LTRs identified. It consist of two components, sequence file name and id in the file. For example, chr1_2 is the second LTR retrotransposon in the chr1 file.
87 2. start position of 5’ LTR.
88 3. end position of 5’ LTR.
89 4. start position of 3’ LTR.
90 5. end position of 3’ LTR.
91 6. strand: + or -.
92 7. length of 5’ LTR.
93 8. length of 3’ LTR.
94 9. length of the LTR retrotransposon.
95 10. TSD on the left side of the LTR retotransposons.
96 11. TSD on the right side of the LTR retrotransposons.
97 12. di(tri)nucleotide on the left side of 5’LTR
98 13. di(tri)nucleotide on the right side of 5’LTR
99 14. di(tri)nucleotide on the left side of 3’LTR
100 15. di(tri)nucleotide on the right side of 3’LTR
101
102 B. MGEScan_nonLTR:
103 Upon completion, MGEScan-nonLTR generates the directory, "info" in the data directory you
104 specified. In this "info" directory, two sub-directories ("full" and "validation") are
105 generated.
106
107 * The "full" directory is for storing sequences of elements. Each subdirectory in "full"
108 is the name of clade. In each directory of clade, the DNA sequences of nonLTRs identified
109 are listed. Each sequence is in fasta format. The header contains the position
110 information of TEs identified: [genome_file_name]_[start position in the sequence]
111
112 For example, >chr1_333 means that this element start at 333bp in the "chr1" file.
113
114 * The "validation" directory is for storing Q values.
115 In the files "en" and "rt", the first column corresponds to the element name and the last column Q value.
116
117 License
118 ============
119 Copyright 2015.
120 You may redistribute this software under the terms of the GNU General Public License.
121
122 </help>
123 </tool>