annotate PhageDPO.xml @ 13:f51906b109cd draft

Uploaded
author jose_duarte
date Fri, 26 Nov 2021 14:42:21 +0000
parents 808f6cdf2e9f
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
12
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
1 <tool id="PhageDPO" name="PhageDPO" version="0.1.0" python_template_version="3.5">
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
2 <description>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
3 Phage Depolymerase Finder
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
4 </description>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
5 <requirements>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
6 <requirement type="package" version="1.78">biopython</requirement>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
7 <requirement type="package" version="0.24.1">scikit-learn</requirement>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
8 <requirement type="package" version="1.21.2">numpy</requirement>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
9 <requirement type="package" version="1.2.3">pandas</requirement>
13
f51906b109cd Uploaded
jose_duarte
parents: 12
diff changeset
10 <requirement type="package" version="1.0.2">propy3</requirement>
12
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
11 </requirements>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
12 <command detect_errors="exit_code"><![CDATA[
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
13 python '$__tool_directory__/DPOGALAXY.py' '$adv.model' '${input1}'
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
14 ]]></command>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
15 <inputs>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
16 <param type="data" name="input1" format="fasta" label="Fasta file"/>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
17 <section name = 'adv' title= 'Advanced Options' expanded = 'False'>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
18 <param type = "select" name="model" label="Model">
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
19 <option value="SVM4311" selected="yes">SVM4311</option>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
20 <option value="ANN4311">ANN4311</option>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
21 <option value="RF5748">RF5748</option>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
22 </param>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
23 </section>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
24
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
25 </inputs>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
26 <outputs>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
27 <data name="output1" format="html" from_work_dir="output.html"
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
28 label="DPO Prediction"/>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
29 </outputs>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
30 <tests>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
31 <test>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
32 <param name="model" value="SVM4311"/>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
33 <param name="input1" value="fasta_file.fasta"/>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
34 <output name="output1" file="output.html"/>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
35 </test>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
36 </tests>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
37 <help><![CDATA[
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
38
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
39 ========
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
40 PhageDPO
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
41 ========
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
42
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
43 Predicts the existance of Phage Polysaccharide Depolymerase.
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
44
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
45 PhageDPO is a python script that predicts the existance of depolymerases (DPOs) using supervised machine learning models. Two different datasets were used to develop three models: The SVM and ANN models was built using a dataset with 45 features and 4311 examples (1437 positives and 2874 negatives) and the RF model was created using a dataset with 54 features and 5748 examples (1437 positives and 4311 negatives).
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
46
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
47 **Inputs:**
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
48
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
49 * fasta file: fasta file format contain the ORF or the CDS.
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
50
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
51 **Advanced options:**
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
52
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
53 * Model: selection of the model to run: the SVM model (default), ANN model or the RF model. The SVM and ANN models focus on true positive detection. On the other hand, the RF model uses more negative data ensuring that all DPOs are identified, although with some misclassifications.
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
54
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
55 **Outputs:**
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
56
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
57 The tool outputs an html file containing the name of the sequence and the percentage of positive prediction for DPO.
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
58
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
59 **Requirements:**
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
60
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
61 * Biopython
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
62 * Sklearn
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
63 * Numpy
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
64 * Pandas
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
65 * Propy
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
66
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
67 ]]></help>
808f6cdf2e9f Uploaded
jose_duarte
parents:
diff changeset
68 </tool>