comparison phylogenies/genetree_read_placement.xml @ 0:5b9a38ec4a39 draft default tip

First commit of old repositories
author osiris_phylogenetics <ucsb_phylogenetics@lifesci.ucsb.edu>
date Tue, 11 Mar 2014 12:19:13 -0700
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:5b9a38ec4a39
1 <tool id="genetree_read_placement" name="genetree_read_placement" version="1.0.0">
2 <description>Places reads on a gene tree chosen from a menu.</description>
3 <requirements>
4 <requirement type="package">raxml</requirement>
5 <requirement type="package">muscle</requirement>
6 <requirement type="package">mafft</requirement>
7 <requirement type="package">prank</requirement>
8 </requirements>
9 <command interpreter="perl">
10 genetree_read_placement.pl $alignment $alignprog ${tree.fields.path} "${tree.fields.name}" > $stdout 2>&amp;1
11 </command>
12 <inputs>
13 <param format="fasta" name="alignment" type="data" label="Genes to place in tree (fasta)"/>
14 <param name="tree" type="select" label="Gene Tree">
15 <options from_file="genetrees.loc">
16 <column name="value" index="0"/>
17 <column name="name" index="1"/>
18 <column name="path" index="2"/>
19 </options>
20 </param>
21 <param name="alignprog" type="select" optional="false" label="Alignment Program" help="Must align reads to genes in tree. Specify which program to use. ">
22 <option value="MUSCLE">MUSCLE</option>
23 <option value="MAFFT">MAFFT with Auto option</option>
24 <option value="PRANK">PRANK with -F option</option>
25 </param>
26 </inputs>
27 <outputs>
28 <data format="txt" name="stdout" label="${tool.name} on ${on_string}: stdout" />
29 <data format="txt" name="RAxML_classification.EPA_TEST" label="${tool.name} on ${on_string}: RAxML_classification" from_work_dir="RAxML_classification.EPA_TEST" />
30 <data format="txt" name="RAxML_classificationLikelihoodWeights.EPA_TEST" label="${tool.name} on ${on_string}: RAxML_classificationLikelihoodWeights" from_work_dir="RAxML_classificationLikelihoodWeights.EPA_TEST" />
31 <data format="txt" name="RAxML_info.EPA_TEST" label="${tool.name} on ${on_string}: RAxML_info" from_work_dir="RAxML_info.EPA_TEST" />
32 <data format="txt" name="RAxML_labelledTree.EPA_TEST" label="${tool.name} on ${on_string}: RAxML_labelledTree" from_work_dir="RAxML_labelledTree.EPA_TEST" />
33 <data format="txt" name="RAxML_originalLabelledTree.EPA_TEST" label="${tool.name} on ${on_string}: RAxML_originalLabelledTree" from_work_dir="RAxML_originalLabelledTree.EPA_TEST" />
34 <data format="tabular" name="treeout.tab" label="PIA Result: ${tool.name} on ${on_string} name tab tree " from_work_dir="treeout.tab" />
35 </outputs>
36 <tests>
37 </tests>
38 <help>
39 **What it does**
40
41 This tool places unknown genes into a pre-calculated gene phylogeny. This can be used for annotating unknown genes.
42
43 ------
44
45 **Inputs**
46
47 1. Input file is a file of sequences.
48 2. The user selects a program to perform multiple sequence alignment of the input genes plus a database.
49 3. Second input is selected from a list of gene trees, that are specified in a .loc file (see additional information below).
50
51 ------
52
53 **Outputs**
54
55 RAxML writes the resulting tree file in newick text format, which can be viewed in Osiris with TreeVector (of the mothur package). In addition, if bootstrapping was selected, the individual bootstrap trees and the ML tree with support are written as separate newick files.
56
57 -------
58
59 **Installation Information**
60
61 1. The command this tool runs is:
62 raxmlHPC-PTHREADS-SSE3 -f v -s $alignment -m PROTGAMMAWAG -t $tree -n EPA_TEST -T 8
63
64 Which specifies 8 concurrent threads with -T 8. Change the xml if you want to call different numbers of threads. If using pbs or other job runner, make sure universe.ini file is set to match the number of threads requested.
65
66 2. Adding the trees that pop up on the menu and associated data used to build those trees requires
67 adding a genetrees.loc file in the tool-data directory of Galaxy. Each line of the loc file
68 specifies a data set, using three columns tab separated:
69
70 unique_id TAB caption for menu TAB /base_name_path/
71
72 So, for example, if your gene family is named opsin and the path to data files is /home/galaxy/data/genetrees/. The base name is used to specify two files basename.fas and basename.tre. In this case the path directory would contain opsin.fas and opsin.tre
73
74 opsin.tre is a newick phylogeny file and opsin.fas is a fasta file with the sequences (with the same names) used to make opsin.tre
75
76
77 Example of .loc file line
78
79 opsin Porter Opsin Tree /home/galaxy/data/genetrees/opsin
80
81 raxml Home Page:
82 http://www.exelixis-lab.org/software.html
83
84 -------
85
86 **Citations**
87
88 This tool is part of the Osiris Phylogenetics Tool Package for Galaxy. If you make extensive use of this tool in a publication, please consider citing the following.
89
90 Current Osiris Citation is here
91
92 http://osiris-phylogenetics.blogspot.com/2012/10/citation.html
93
94 Additional Citations for this tool
95
96 S.A. Berger, D. Krompass. Stamatakis: "Performance, Accuracy and Web-Server for Evolutionary Placement of Short Sequence Reads under maximum-likelihood". In Systematic Biology 60(3):291-302, 2011.
97
98 Stamatakis, A. (2006). RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics.
99 http://bioinformatics.oxfordjournals.org/content/22/21/2688.short
100
101 See also references for MAFFT, PRANK, and MUSCLE.
102
103 </help>
104 </tool>
105
106
107
108
109