Mercurial > repos > konradpaszkiewicz > interproscan
view README_INTERPROSCAN @ 0:49e20fa2c66d
Migrated tool version 1.0.0 from old tool shed archive to new tool shed repository
author | konradpaszkiewicz |
---|---|
date | Tue, 07 Jun 2011 17:27:16 -0400 |
parents | |
children |
line wrap: on
line source
#Created 07/01/2011 - Konrad Paszkiewicz, Exeter Sequencing Service, University of Exeter The attached is a crude wrapper script for Interproscan. Typically this is useful when one wants to produce an annotation which is not based on sequence similarity. E.g after a denovo transcriptome assembly, each transcript could be translated and run through this tool. Prerequisites: 1. A working installation of Interproscan on your Galaxy server/cluster. Limitations: Currently it is setup to work with PFAM only due to the heavy computational demands Interproscan makes. Input formats: The standard interproscan input is either genomic or protein sequences. In the case of genomic sequences Interproscan will of run an ORF prediction tool. However this tends to lose the ORF information (e.g. start/end co-ordinates) from the header. As such the requirement here is to input ORF sequences (e.g. from EMBOSS getorf) and to then replace any spaces in the FASTA header with underscores. This workaround generally preserves the relevant positional information.