view tools/clinod/clinod.xml @ 2:d1aebb0acee7 draft

Uploaded v0.0.5, automated installation
author peterjc
date Thu, 30 May 2013 13:26:33 -0400
parents
children 6a9debe4b860
line wrap: on
line source

<tool id="clinod" name="Nucleolar localization sequence Detector (NoD)" version="0.0.5">
    <description>Find nucleolar localization signals (NoLSs) in protein sequences</description>
    <requirements>
        <requirement type="binary">java</requirement>
        <requirement type="package" version="1.3">clinod</requirement>
    </requirements>
    <command>
##The Galaxy Tool Shed installation should define $CLINOD to point at folder
##containing both clinod-1.3.jar and the batchman binary:
java -jar \$CLINOD/clinod-1.3.jar -in="$fasta_file" -out="$tabular_file" -t=2 -f=MEDIUM_TAB -nonols -clean_sequence
##I want the number of threads to be a Galaxy config option...
##TODO - Make the -clean_sequence argument a parameter?
    </command>
    <stdio>
        <!-- Assume anything other than zero is an error -->
        <exit_code range="1:" />
        <exit_code range=":-1" />
    </stdio>
    <inputs>
        <param name="fasta_file" type="data" format="fasta" label="FASTA file of protein sequences"/> 
    </inputs>
    <outputs>
        <data name="tabular_file" format="tabular" label="NoD results" />
    </outputs>
    <tests>
        <test>
            <param name="fasta_file" value="four_human_proteins.fasta" ftype="fasta" />
	    <output name="tabular_file" file="four_human_proteins.clinod-1.3.tabular" ftype="tabular" />
        </test>
    </tests>
    <help>

**What it does**

This calls the command line version of the NoD tool from the Barton Group for
prediction of nucleolar localization sequences (NoLSs). The NoD tool uses an
artificial neural network trained on a set of human NoLSs.

The nucleolus is a sub-compartmentof the nucleus, thus an NoLS can be regarded
as a special nuclear localization sequence (NLS).

The input is a FASTA file of protein sequences, and the output is tabular with
four columns (multiple rows per protein):

====== ===================
Column Description
------ -------------------
     1 Sequence identifier
     2 Start of NoLS
     3 End of NoLS
     4 NoLS sequence
====== ===================

If a sequence has no predicted NoLS, then there is no line in the output file
for it.

-----

**References**

M. S. Scott, F. M. Boisvert, M. D. McDowall, A. I. Lamond and G. J. Barton.
Characterization and prediction of protein nucleolar localization sequences.
Nucleic Acids Research 38(21), 7388-7399, 2010.
http://dx.doi.org/10.1093/nar/gkq653

M. S. Scott, P. V. Troshin and G. J. Barton.
NoD: a Nucleolar localization sequence detector for eukaryotic and viral proteins.
BMC Bioinformatics, 12:317, 2011.
http://dx.doi.org/10.1186/1471-2105-12-317

http://www.compbio.dundee.ac.uk/www-nod/

This wrapper is available to install into other Galaxy Instances via the Galaxy
Tool Shed at http://toolshed.g2.bx.psu.edu/view/peterjc/clinod

    </help>
</tool>