annotate PhageDPO.xml @ 19:d01d8135e1d7 draft

Uploaded
author jose_duarte
date Wed, 08 Dec 2021 10:30:23 +0000
parents 3e201ce0de44
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
16
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
1 <tool id="PhageDPO" name="PhageDPO" version="0.1.0" python_template_version="3.5">
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
2 <description>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
3 Phage Depolymerase Finder
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
4 </description>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
5 <requirements>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
6 <requirement type="package" version="1.78">biopython</requirement>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
7 <requirement type="package" version="0.24.1">scikit-learn</requirement>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
8 <requirement type="package">numpy</requirement>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
9 <requirement type="package" version="1.2.3">pandas</requirement>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
10 <requirement type="package" version="1.0.2">propy3</requirement>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
11 </requirements>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
12 <command detect_errors="exit_code"><![CDATA[
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
13 python '$__tool_directory__/DPOGALAXY.py' '$adv.model' '${input1}'
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
14 ]]></command>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
15 <inputs>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
16 <param type="data" name="input1" format="fasta" label="Fasta file"/>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
17 <section name = 'adv' title= 'Advanced Options' expanded = 'False'>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
18 <param type = "select" name="model" label="Model">
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
19 <option value="SVM4311" selected="yes">SVM4311</option>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
20 <option value="ANN7185">ANN7185</option>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
21 </param>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
22 </section>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
23
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
24 </inputs>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
25 <outputs>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
26 <data name="output1" format="html" from_work_dir="output.html"
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
27 label="DPO Prediction"/>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
28 </outputs>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
29 <tests>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
30 <test>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
31 <param name="model" value="SVM4311"/>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
32 <param name="input1" value="fasta_file.fasta"/>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
33 <output name="output1" file="output.html"/>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
34 </test>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
35 </tests>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
36 <help><![CDATA[
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
37
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
38 ========
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
39 PhageDPO
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
40 ========
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
41
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
42 Predicts the existance of Phage Polysaccharide Depolymerase.
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
43
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
44 PhageDPO is a python script that predicts the existance of depolymerases (DPOs) using supervised machine learning models.
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
45 Two different datasets were used to develop two models: The SVM model was built using a dataset with 45 features and 4311 examples (1437 positives and 2874 negatives) and the ANN model was created using a dataset with 166 features and 7185 examples (1437 positives and 5748 negatives).
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
46
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
47 **Inputs:**
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
48
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
49 * fasta file: fasta file format contain the nucleotide sequences.
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
50
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
51 **Advanced options:**
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
52
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
53 * Model: selection of the model to run: the SVM model (default) or the ANN model. The SVM model focus on true positive detection while avoiding false positives. On the other hand, the ANN model uses more negative data ensuring that all DPOs are identified.
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
54
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
55 **Outputs:**
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
56
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
57 The tool outputs an html file containing the name of the sequence and the percentage of positive prediction for DPO.
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
58
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
59 **Requirements:**
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
60
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
61 * Biopython
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
62 * Sklearn
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
63 * Numpy
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
64 * Pandas
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
65 * Propy
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
66
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
67 ]]></help>
3e201ce0de44 Uploaded
jose_duarte
parents:
diff changeset
68 </tool>