comparison tools/protein_analysis/promoter2.xml @ 20:a19b3ded8f33 draft

v0.2.11 Job splitting fast-fail; RXLR tools supports HMMER2 from BioConda; Capture more version information; misc internal changes
author peterjc
date Thu, 21 Sep 2017 11:35:20 -0400
parents eb6ac44d4b8e
children 238eae32483c
comparison
equal deleted inserted replaced
19:f3ecd80850e2 20:a19b3ded8f33
1 <tool id="promoter2" name="Promoter 2.0" version="0.0.10"> 1 <tool id="promoter2" name="Promoter 2.0" version="0.0.12">
2 <description>Find eukaryotic PolII promoters in DNA sequences</description> 2 <description>Find eukaryotic PolII promoters in DNA sequences</description>
3 <!-- If job splitting is enabled, break up the query file into parts --> 3 <!-- If job splitting is enabled, break up the query file into parts -->
4 <!-- Using 2000 per chunk so 4 threads each doing 500 is ideal --> 4 <!-- Using 2000 per chunk so 4 threads each doing 500 is ideal -->
5 <parallelism method="basic" split_inputs="fasta_file" split_mode="to_size" split_size="2000" merge_outputs="tabular_file"></parallelism> 5 <parallelism method="basic" split_inputs="fasta_file" split_mode="to_size" split_size="2000" merge_outputs="tabular_file"></parallelism>
6 <requirements> 6 <requirements>
7 <requirement type="binary">promoter</requirement>
8 <requirement type="package">promoter</requirement> 7 <requirement type="package">promoter</requirement>
9 </requirements> 8 </requirements>
10 <stdio> 9 <version_command>
11 <!-- Anything other than zero is an error --> 10 python $__tool_directory__/promoter2.py --version
12 <exit_code range="1:" /> 11 </version_command>
13 <exit_code range=":-1" /> 12 <command detect_errors="aggressive">
14 </stdio> 13 python $__tool_directory__/promoter2.py "\$GALAXY_SLOTS" '$fasta_file' '$tabular_file'
15 <version_command interpreter="python">promoter2.py --version</version_command>
16 <command interpreter="python">
17 promoter2.py "\$GALAXY_SLOTS" "$fasta_file" "$tabular_file"
18 ##If the environment variable isn't set, get "", and the python wrapper
19 ##defaults to four threads.
20 </command> 14 </command>
21 <inputs> 15 <inputs>
22 <param name="fasta_file" type="data" format="fasta" label="FASTA file of DNA sequences"/> 16 <param name="fasta_file" type="data" format="fasta" label="FASTA file of DNA sequences"/>
23 </inputs> 17 </inputs>
24 <outputs> 18 <outputs>
25 <data name="tabular_file" format="tabular" label="Promoter2 on ${fasta_file.name}" /> 19 <data name="tabular_file" format="tabular" label="Promoter2 on ${fasta_file.name}" />
26 </outputs> 20 </outputs>
27 <tests> 21 <tests>
33 <param name="fasta_file" value="empty.fasta" ftype="fasta"/> 27 <param name="fasta_file" value="empty.fasta" ftype="fasta"/>
34 <output name="tabular_file" file="empty_promoter2.tabular" ftype="tabular"/> 28 <output name="tabular_file" file="empty_promoter2.tabular" ftype="tabular"/>
35 </test> 29 </test>
36 </tests> 30 </tests>
37 <help> 31 <help>
38 32
39 **What it does** 33 **What it does**
40 34
41 This calls the Promoter 2.0 tool for prediction of eukaryotic PolII promoter sequences using a Neural Network (NN) model. 35 This calls the Promoter 2.0 tool for prediction of eukaryotic PolII promoter sequences using a Neural Network (NN) model.
42 36
43 The input is a FASTA file of nucleotide sequences (e.g. upstream regions of your genes), and the output is tabular with five columns (one row per promoter): 37 The input is a FASTA file of nucleotide sequences (e.g. upstream regions of your genes), and the output is tabular with five columns (one row per promoter):
57 Score Description 51 Score Description
58 --------- ------------------------ 52 --------- ------------------------
59 below 0.5 ignored 53 below 0.5 ignored
60 0.5 - 0.8 Marginal prediction 54 0.5 - 0.8 Marginal prediction
61 0.8 - 1.0 Medium likely prediction 55 0.8 - 1.0 Medium likely prediction
62 above 1.0 Highly likely prediction 56 above 1.0 Highly likely prediction
63 ========= ======================== 57 ========= ========================
64 58
65 Internally the input FASTA file is divided into parts (to allow multiple processors to be used), and the raw output is reformatted into this tabular layout suitable for downstream analysis within Galaxy. 59 Internally the input FASTA file is divided into parts (to allow multiple processors to be used), and the raw output is reformatted into this tabular layout suitable for downstream analysis within Galaxy.
66 60
67 **References** 61 **References**