annotate interproscan5-e32f2ea6a139/interproscan.xml @ 2:5a720d9e7071 draft default tip

Uploaded
author si-datascience
date Thu, 24 May 2018 14:39:59 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
2
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
1 <tool id="interproscan" name="Interproscan functional predictions of ORFs" version="5.0.0">
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
2 <description>Interproscan functional predictions of ORFs</description>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
3 <requirements>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
4 <requirement type="package">signalp</requirement>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
5 <requirement type="package">phobius</requirement>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
6 <requirement type="package">tmhmm</requirement>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
7 <requirement type="set_environment">INTERPROSCAN_SCRIPT_PATH</requirement>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
8 </requirements>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
9 <command>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
10
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
11 #import os
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
12 ./interproscan.sh
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
13 ## disables the precalculated lookup service, all calculation will be run locally
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
14 -dp
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
15 --input $infile
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
16 --seqtype $seqtype
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
17 -f $oformat
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
18 --applications $appl
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
19 --tempdir \$TEMP
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
20
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
21 $pathways
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
22 $goterms
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
23 $iprlookup
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
24
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
25 #if str($oformat) in ['SVG', 'HTML']:
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
26 --output-file-base $outfile
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
27 2>&#38;1;
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
28 mkdir -p $outfile.files_path;
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
29 #set temp_archive_file = str($outfile) + '.' + str($oformat).lower() + '.tar.gz'
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
30 tar -C $outfile.files_path -xvmzf $temp_archive_file;
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
31 python \$INTERPROSCAN_SCRIPT_PATH/create_index.py $outfile $outfile.files_path;
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
32 rm $temp_archive_file
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
33 #else:
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
34 -o $outfile
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
35 2>&#38;1
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
36 #end if
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
37
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
38 </command>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
39 <inputs>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
40 <param name="infile" type="data" format="fasta" label="Protein Fasta File"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
41
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
42 <param name="seqtype" type="select" label="Type of the input sequences" help="">
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
43 <option value="p" selected="true">Protein</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
44 <option value="n">DNA / RNA</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
45 </param>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
46
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
47 <param name="appl" type="select" multiple="True" display="checkboxes" label="Applications to run" help="Select your programm.">
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
48 <option value="TIGRFAM" selected="true">TIGRFAM: protein families based on Hidden Markov Models or HMMs</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
49 <option value="PIRSF" selected="true">PIRSF: non-overlapping clustering of UniProtKB sequences into a hierarchical order (evolutionary relationships)</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
50 <option value="ProDom" selected="true">ProDom: set of protein domain families generated from the UniProtKB</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
51 <option value="Panther" selected="true">Panther: Protein ANalysis THrough Evolutionary Relationships</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
52 <option value="SMART" selected="true">SMART: identification and analysis of domain architectures based on Hidden Markov Models or HMMs</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
53 <option value="PrositeProfiles" selected="true">PROSITE Profiles: protein domains, families and functional sites as well as associated profiles to identify them</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
54 <option value="PrositePatterns" selected="true">PROSITE Pattern: protein domains, families and functional sites as well as associated patterns to identify them</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
55 <option value="HAMAP" selected="true">HAMAP: High-quality Automated Annotation of Microbial Proteomes</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
56 <option value="PfamA" selected="true">PfamA: protein families, each represented by multiple sequence alignments and hidden Markov models</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
57 <option value="PRINTS" selected="true">PRINTS: group of conserved motifs (fingerprints) used to characterise a protein family</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
58 <option value="SuperFamily" selected="true">SUPERFAMILY: database of structural and functional annotation</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
59 <option value="Coils" selected="true">Coils: Prediction of Coiled Coil Regions in Proteins</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
60 <option value="Gene3d" selected="true">Gene3d: Structural assignment for whole genes and genomes using the CATH domain structure database</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
61 <option value="SignalP-GRAM_POSITIVE" selected="false">SignalP Gram Positive Bacteria</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
62 <option value="SignalP-GRAM_NEGATIVE" selected="false">SignalP Gram Negative Bacteria</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
63 <option value="SignalP-EUK" selected="true">SignalP Eukaryotic Bacteria</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
64 <option value="Phobius" selected="true">Phobius: combined transmembrane topology and signal peptide predictor</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
65 <option value="TMHMM" selected="true">TMHMM: Prediction of transmembrane helices in proteins</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
66 </param>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
67
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
68 <param name="pathways" truevalue="--pathways" falsevalue="" checked="True" type="boolean" label="Include pathway information"
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
69 help="Option that provides mappings from matches to pathway information, which is based on the matched manually curated InterPro entries. (--pathways)"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
70 <param name="goterms" truevalue="--goterms" falsevalue="" checked="True" type="boolean" label="Include Gene Ontology (GO) mappings"
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
71 help="Look up of corresponding Gene Ontology annotation. Implies -iprlookup option. (--goterms)"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
72 <param name="iprlookup" truevalue="--iprlookup" falsevalue="" checked="False" type="boolean"
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
73 label="Provide additional mappings" help="Provide mappings from matched member database signatures to the InterPro entries that they are integrated into (--iprlookup)"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
74
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
75 <param name="oformat" type="select" label="Output format" help="Please select a output format.">
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
76 <option value="TSV" selected="true">Tab-separated values format (TSV)</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
77 <option value="GFF3">GFF3</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
78 <option value="SVG">SVG</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
79 <option value="HTML">HTML</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
80 <option value="XML">XML</option>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
81 </param>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
82
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
83 </inputs>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
84 <outputs>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
85
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
86 <data format="tabular" name="outfile" label="Interproscan calculation on ${on_string}">
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
87 <change_format>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
88 <when input="oformat" value="HTML" format="html"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
89 <when input="oformat" value="XML" format="xml"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
90 <when input="oformat" value="SVG" format="html"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
91 <when input="oformat" value="GFF3" format="gff"/>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
92 </change_format>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
93 </data>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
94
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
95 </outputs>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
96 <requirements>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
97 </requirements>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
98 <help>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
99
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
100 **What it does**
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
101
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
102 Interproscan is a batch tool to query the Interpro database. It provides annotations based on multiple searches of profile and other functional databases.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
103
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
104
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
105 #####
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
106 Input
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
107 #####
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
108
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
109 Required is a FASTA file containing protein or nucleotide sequences.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
110
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
111
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
112 ######
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
113 Output
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
114 ######
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
115
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
116 In this version of InterProScan_, you can retrieve output in any of the following five formats:
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
117
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
118 * TSV: a simple tab-delimited file format
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
119 * XML: the new "IMPACT" XML format (XSD available here_).
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
120 * GFF: The `GFF 3.0`_ format
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
121 * HTML: An HTML representation of the protein matches
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
122 * SVG: An Scalable Vector Graphics representation of the protein matches
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
123
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
124
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
125 .. _`GFF 3.0`: http://gmod.org/wiki/GFF#GFF3_Format
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
126 .. _here: http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
127
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
128
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
129
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
130 Tab-separated values format (TSV)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
131 =================================
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
132
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
133 Basic tab delimited format.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
134
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
135
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
136 Example Output
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
137 --------------
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
138
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
139 ::
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
140
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
141 P51587 14086411a2cdf1c4cba63020e1622579 3418 Pfam PF09103 BRCA2, oligonucleotide/oligosaccharide-binding, domain 1 2670 2799 7.9E-43 T 15-03-2013
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
142 P51587 14086411a2cdf1c4cba63020e1622579 3418 ProSiteProfiles PS50138 BRCA2 repeat profile. 1002 1036 0.0 T 18-03-2013 IPR002093 BRCA2 repeat GO:0005515|GO:0006302
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
143 P51587 14086411a2cdf1c4cba63020e1622579 3418 Gene3D G3DSA:2.40.50.140 2966 3051 3.1E-52 T 15-03-2013
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
144 ...
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
145
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
146
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
147 The TSV format presents the match data in columns as follows:
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
148
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
149 - Protein Accession (e.g. P51587)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
150 - Sequence MD5 digest (e.g. 14086411a2cdf1c4cba63020e1622579)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
151 - Sequence Length (e.g. 3418)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
152 - Analysis (e.g. Pfam / PRINTS / Gene3D)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
153 - Signature Accession (e.g. PF09103 / G3DSA:2.40.50.140)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
154 - Signature Description (e.g. BRCA2 repeat profile)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
155 - Start location
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
156 - Stop location
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
157 - Score - is the e-value of the match reported by member database method (e.g. 3.1E-52)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
158 - Status - is the status of the match (T: true)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
159 - Date - is the date of the run
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
160 - (InterProScan_ annotations - accession (e.g. IPR002093) - optional column; only displayed if -iprscan option is switched on)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
161 - (InterProScan_ annotations - description (e.g. BRCA2 repeat) - optional column; only displayed if -iprscan option is switched on)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
162 - (GO annotations (e.g. GO:0005515) - optional column; only displayed if --goterms option is switched on)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
163 - (Pathways annotations (e.g. REACT_71) - optional column; only displayed if --pathways option is switched on)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
164
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
165
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
166 Extensible Markup Language (XML)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
167 ================================
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
168
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
169 XML representation of the matches - this is the richest form of the data. The XML Schema Definition (XSD) is available [http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5 here].
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
170
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
171 Example Output
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
172 --------------
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
173
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
174 .. image:: $PATH_TO_IMAGES/example_xml_output.png
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
175
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
176
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
177
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
178 Generic Feature Format Version 3 (GFF3)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
179 =======================================
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
180
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
181 The GFF3 format is a flat tab-delimited file, which is much richer then the TSV output format. It allows you to trace back from matches to predicted proteins and to nucleic acid sequences. It also contains a FASTA format representation of the predicted protein sequences and their matches. You will find a documentation of all the columns and attributes used on [http://www.sequenceontology.org/gff3.shtml].
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
182
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
183 Example Output
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
184 --------------
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
185
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
186 ::
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
187
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
188 ##gff-version 3
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
189 ##feature-ontology http://song.cvs.sourceforge.net/viewvc/song/ontology/sofa.obo?revision=1.269
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
190 ##sequence-region AACH01000027 1 1347
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
191 ##seqid|source|type|start|end|score|strand|phase|attributes
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
192 AACH01000027 provided_by_user nucleic_acid 1 1347 . + . Name=AACH01000027;md5=b2a7416cb92565c004becb7510f46840;ID=AACH01000027
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
193 AACH01000027 getorf ORF 1 1347 . + . Name=AACH01000027.2_21;Target=pep_AACH01000027_1_1347 1 449;md5=b2a7416cb92565c004becb7510f46840;ID=orf_AACH01000027_1_1347
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
194 AACH01000027 getorf polypeptide 1 449 . + . md5=fd0743a673ac69fb6e5c67a48f264dd5;ID=pep_AACH01000027_1_1347
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
195 AACH01000027 Pfam protein_match 84 314 1.2E-45 + . Name=PF00696;signature_desc=Amino acid kinase family;Target=null 84 314;status=T;ID=match$8_84_314;Ontology_term="GO:0008652";date=15-04-2013;Dbxref="InterPro:IPR001048","Reactome:REACT_13"
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
196 ##sequence-region 2
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
197 ...
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
198 >pep_AACH01000027_1_1347
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
199 LVLLAAFDCIDDTKLVKQIIISEIINSLPNIVNDKYGRKVLLYLLSPRDPAHTVREIIEV
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
200 LQKGDGNAHSKKDTEIRRREMKYKRIVFKVGTSSLTNEDGSLSRSKVKDITQQLAMLHEA
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
201 GHELILVSSGAIAAGFGALGFKKRPTKIADKQASAAVGQGLLLEEYTTNLLLRQIVSAQI
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
202 LLTQDDFVDKRRYKNAHQALSVLLNRGAIPIINENDSVVIDELKVGDNDTLSAQVAAMVQ
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
203 ADLLVFLTDVDGLYTGNPNSDPRAKRLERIETINREIIDMAGGAGSSNGTGGMLTKIKAA
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
204 TIATESGVPVYICSSLKSDSMIEAAEETEDGSYFVAQEKGLRTQKQWLAFYAQSQGSIWV
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
205 DKGAAEALSQYGKSLLLSGIVEAEGVFSYGDIVTVFDKESGKSLGKGRVQFGASALEDML
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
206 RSQKAKGVLIYRDDWISITPEIQLLFTEF
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
207 ...
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
208 >match$8_84_314
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
209 KRIVFKVGTSSLTNEDGSLSRSKVKDITQQLAMLHEAGHELILVSSGAIAAGFGALGFKK
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
210 RPTKIADKQASAAVGQGLLLEEYTTNLLLRQIVSAQILLTQDDFVDKRRYKNAHQALSVL
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
211 LNRGAIPIINENDSVVIDELKVGDNDTLSAQVAAMVQADLLVFLTDVDGLYTGNPNSDPR
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
212 AKRLERIETINREIIDMAGGAGSSNGTGGMLTKIKAATIATESGVPVYICS
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
213
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
214
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
215 Scalable Vector Graphics (SVG) and HyperText Markup Language (HTML)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
216 ====================================================================
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
217
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
218 InterProScan_ 5 outputs a single HTML/SVG file for each protein sequence analysed.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
219
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
220
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
221 Example Output
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
222 --------------
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
223
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
224 .. image:: $PATH_TO_IMAGES/P51587.svg.png
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
225
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
226 .. _InterProScan: http://www.ebi.ac.uk/interpro
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
227
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
228
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
229 ----------
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
230 References
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
231 ----------
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
232
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
233
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
234 If you use this Galaxy tool in work leading to a scientific publication please
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
235 cite the following papers:
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
236
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
237 Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013).
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
238 Galaxy tools and workflows for sequence analysis with applications
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
239 in molecular plant pathology. PeerJ 1:e167
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
240 http://dx.doi.org/10.7717/peerj.167
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
241
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
242 Zdobnov EM, Apweiler R (2001)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
243 InterProScan an integration platform for the signature-recognition methods in InterPro.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
244 Bioinformatics 17, 847-848.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
245 http://dx.doi.org/10.1093/bioinformatics/17.9.847
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
246
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
247 Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R (2005)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
248 InterProScan: protein domains identifier.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
249 Nucleic Acids Research 33 (Web Server issue), W116-W120.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
250 http://dx.doi.org/10.1093/nar/gki442
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
251
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
252 Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, Finn RD, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Laugraud A, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Mulder N, Natale D, Orengo C, Quinn AF, Selengut JD, Sigrist CJ, Thimma M, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. (2009)
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
253 InterPro: the integrative protein signature database.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
254 Nucleic Acids Research 37 (Database Issue), D224-228.
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
255 http://dx.doi.org/10.1093/nar/gkn785
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
256
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
257
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
258 This wrapper is available to install into other Galaxy Instances via the Galaxy Tool Shed at
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
259 http://toolshed.g2.bx.psu.edu/view/bgruening/interproscan5
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
260
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
261
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
262 **Galaxy Wrapper Author**::
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
263
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
264 * Bjoern Gruening, University of Freiburg
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
265 * Konrad Paszkiewicz, University of Exeter
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
266
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
267 </help>
5a720d9e7071 Uploaded
si-datascience
parents:
diff changeset
268 </tool>