view tools/protein_analysis/clinod.xml @ 1:221d7dca03a5 draft

Uploaded v0.0.4, which adds a unit test
author peterjc
date Tue, 16 Apr 2013 13:01:54 -0400
parents b8be455c27d1
children
line wrap: on
line source

<tool id="clinod" name="Nucleolar localization sequence Detector (NoD)" version="0.0.4">
    <description>Find nucleolar localization signals (NoLSs) in protein sequences</description>
    <command>
      java -jar /opt/clinod/clinod-1.3.jar -in="$fasta_file" -out="$tabular_file" -t=8 -f=MEDIUM_TAB -nonols -clean_sequence
      ##I want the number of threads to be a Galaxy config option...
      ##TODO - Make the -clean_sequence argument a parameter?
    </command>
    <stdio>
        <!-- Assume anything other than zero is an error -->
        <exit_code range="1:" />
        <exit_code range=":-1" />
    </stdio>
    <inputs>
        <param name="fasta_file" type="data" format="fasta" label="FASTA file of protein sequences"/> 
    </inputs>
    <outputs>
        <data name="tabular_file" format="tabular" label="NoD results" />
    </outputs>
    <requirements>
        <requirement type="binary">java</requirement>
    </requirements>
    <tests>
        <test>
            <param name="fasta_file" value="four_human_proteins.fasta" ftype="fasta" />
	    <output name="tabular_file" file="four_human_proteins.clinod-1.3.tabular" ftype="tabular" />
        </test>
    </tests>
    <help>

**What it does**

This calls the command line version of the NoD tool from the Barton Group for
prediction of nucleolar localization sequences (NoLSs). The NoD tool uses an
artificial neural network trained on a set of human NoLSs.

The nucleolus is a sub-compartmentof the nucleus, thus an NoLS can be regarded
as a special nuclear localization sequence (NLS).

The input is a FASTA file of protein sequences, and the output is tabular with
four columns (multiple rows per protein):

====== ===================
Column Description
------ -------------------
     1 Sequence identifier
     2 Start of NoLS
     3 End of NoLS
     4 NoLS sequence
====== ===================

If a sequence has no predicted NoLS, then there is no line in the output file
for it.


**References**

M. S. Scott, F. M. Boisvert, M. D. McDowall, A. I. Lamond and G. J. Barton.
Characterization and prediction of protein nucleolar localization sequences.
Nucleic Acids Research 38(21), 7388-7399, 2010.
http://dx.doi.org/10.1093/nar/gkq653

M. S. Scott, P. V. Troshin and G. J. Barton.
NoD: a Nucleolar localization sequence detector for eukaryotic and viral proteins.
BMC Bioinformatics, 12:317, 2011.
http://dx.doi.org/10.1186/1471-2105-12-317

http://www.compbio.dundee.ac.uk/www-nod/

    </help>
</tool>