annotate PhageDPO.xml @ 10:1d0870d578bc draft

Uploaded
author jose_duarte
date Wed, 24 Nov 2021 18:06:51 +0000
parents d8aed4c31383
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
5
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
1 <tool id="PhageDPO" name="PhageDPO" version="0.1.0" python_template_version="3.5">
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
2 <description>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
3 Phage Depolymerase Finder
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
4 </description>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
5 <requirements>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
6 <requirement type="package" version="1.78">biopython</requirement>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
7 <requirement type="package" version="0.24.1">scikit-learn</requirement>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
8 <requirement type="package">numpy</requirement>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
9 <requirement type="package" version="1.2.3">pandas</requirement>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
10 <requirement type="package" version="1.0.0a2">propy3</requirement>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
11 </requirements>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
12 <command detect_errors="exit_code"><![CDATA[
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
13 python '$__tool_directory__/DPOGALAXY.py' '$adv.model' '${input1}'
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
14 ]]></command>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
15 <inputs>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
16 <param type="data" name="input1" format="fasta" label="Fasta file"/>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
17 <section name = 'adv' title= 'Advanced Options' expanded = 'False'>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
18 <param type = "select" name="model" label="Model">
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
19 <option value="SVM4311" selected="yes">SVM4311</option>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
20 <option value="ANN4311">ANN4311</option>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
21 <option value="RF5748">RF5748</option>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
22 </param>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
23 </section>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
24
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
25 </inputs>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
26 <outputs>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
27 <data name="output1" format="html" from_work_dir="output.html"
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
28 label="DPO Prediction"/>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
29 </outputs>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
30 <tests>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
31 <test>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
32 <param name="model" value="SVM4311"/>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
33 <param name="input1" value="fasta_file.fasta"/>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
34 <output name="output1" file="output.html"/>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
35 </test>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
36 </tests>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
37 <help><![CDATA[
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
38
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
39 ========
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
40 PhageDPO
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
41 ========
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
42
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
43 Predicts the existance of Phage Polysaccharide Depolymerase.
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
44
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
45 PhageDPO is a python script that predicts the existance of depolymerases (DPOs) using supervised machine learning models. Two different datasets were used to develop three models: The SVM and ANN models was built using a dataset with 45 features and 4311 examples (1437 positives and 2874 negatives) and the RF model was created using a dataset with 54 features and 5748 examples (1437 positives and 4311 negatives).
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
46
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
47 **Inputs:**
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
48
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
49 * fasta file: fasta file format contain the ORF or the CDS.
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
50
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
51 **Advanced options:**
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
52
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
53 * Model: selection of the model to run: the SVM model (default), ANN model or the RF model. The SVM and ANN models focus on true positive detection. On the other hand, the RF model uses more negative data ensuring that all DPOs are identified, although with some misclassifications.
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
54
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
55 **Outputs:**
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
56
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
57 The tool outputs an html file containing the name of the sequence and the percentage of positive prediction for DPO.
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
58
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
59 **Requirements:**
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
60
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
61 * Biopython
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
62 * Sklearn
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
63 * Numpy
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
64 * Pandas
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
65 * Propy
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
66
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
67 ]]></help>
d8aed4c31383 Uploaded
jose_duarte
parents:
diff changeset
68 </tool>