What it does
This calls the command line tool PSORTb v3.0 for prediction of prokaryotic localization sites. The input dataset needs to be protein FASTA sequences. The default output is a simple tabular file with three columns, one row per query sequence:
Column | Description |
1 | Sequence identifier |
2 | Localisation, e.g. Cytoplasmic |
3 | Score |
The long output is also tabular with one row per query sequence, but has lots more columns (a different set for each supported organism type). In both cases, a simple header line is included (starting with a hash, #, so that Galaxy treats it as a comment) giving the column names.
References
If you use this Galaxy tool in work leading to a scientific publication please cite the following papers:
Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013). Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology. PeerJ 1:e167 https://doi.org/10.7717/peerj.167
N.Y. Yu, J.R. Wagner, M.R. Laird, G. Melli, S. Rey, R. Lo, P. Dao, S.C. Sahinalp, M. Ester, L.J. Foster, F.S.L. Brinkman (2010) PSORTb 3.0: Improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes, Bioinformatics 26(13):1608-1615 https://doi.org/10.1093/bioinformatics/btq249
See also http://www.psort.org/documentation/index.html
This wrapper is available to install into other Galaxy Instances via the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp