annotate interpro/paso3.xml @ 0:c342ebb50f0b draft default tip

Uploaded
author fernando
date Thu, 22 May 2014 05:09:07 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
1 <tool id="CLaGiFer_3" name="Sequences attributes" version="1.0.0">
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
2 <description>Download gff file from InterPro</description>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
3 <command interpreter="bash">
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
4 ./paso3.sh "$infile" "$outfile"
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
5 </command>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
6
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
7 <inputs>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
8 <param name="infile" type="data" format="fasta" label="Fasta file"/>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
9 </inputs>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
10 <outputs>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
11 <data format="gff" name="outfile"/>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
12 </outputs>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
13
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
14 <stdio><exit_code range="1:" level="fatal" description="Error" /></stdio>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
15 <help>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
16
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
17
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
18 **What it does**
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
19
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
20 Interproscan is a batch tool to query the Interpro database. It provides annotations based on multiple searches of profile and other functional databases.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
21
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
22
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
23 **Dependencies**
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
24
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
25 InterProscan package is required to be installed (http://code.google.com/p/interproscan/wiki/HowToDownload).
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
26
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
27
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
28
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
29 #####
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
30 Input
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
31 #####
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
32
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
33 A FASTA file containing protein sequences is required.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
34
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
35
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
36 ######
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
37 Output
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
38 ######
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
39
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
40 Generic Feature Format Version 3 (GFF3)
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
41
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
42 The GFF3 format is a flat tab-delimited file, which is much richer then the TSV output format. It allows you to trace back from matches to predicted proteins and to nucleic acid sequences. It also contains a FASTA format representation of the predicted protein sequences and their matches. You will find a documentation of all the columns and attributes used on [http://www.sequenceontology.org/gff3.shtml].
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
43
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
44 Example Output
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
45 --------------
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
46
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
47 ::
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
48
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
49 ##gff-version 3
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
50 ##feature-ontology http://song.cvs.sourceforge.net/viewvc/song/ontology/sofa.obo?revision=1.269
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
51 ##sequence-region AACH01000027 1 1347
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
52 ##seqid|source|type|start|end|score|strand|phase|attributes
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
53 AACH01000027 provided_by_user nucleic_acid 1 1347 . + . Name=AACH01000027;md5=b2a7416cb92565c004becb7510f46840;ID=AACH01000027
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
54 AACH01000027 getorf ORF 1 1347 . + . Name=AACH01000027.2_21;Target=pep_AACH01000027_1_1347 1 449;md5=b2a7416cb92565c004becb7510f46840;ID=orf_AACH01000027_1_1347
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
55 AACH01000027 getorf polypeptide 1 449 . + . md5=fd0743a673ac69fb6e5c67a48f264dd5;ID=pep_AACH01000027_1_1347
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
56 AACH01000027 Pfam protein_match 84 314 1.2E-45 + . Name=PF00696;signature_desc=Amino acid kinase family;Target=null 84 314;status=T;ID=match$8_84_314;Ontology_term="GO:0008652";date=15-04-2013;Dbxref="InterPro:IPR001048","Reactome:REACT_13"
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
57 ##sequence-region 2
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
58 ...
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
59 >pep_AACH01000027_1_1347
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
60 LVLLAAFDCIDDTKLVKQIIISEIINSLPNIVNDKYGRKVLLYLLSPRDPAHTVREIIEV
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
61 LQKGDGNAHSKKDTEIRRREMKYKRIVFKVGTSSLTNEDGSLSRSKVKDITQQLAMLHEA
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
62 GHELILVSSGAIAAGFGALGFKKRPTKIADKQASAAVGQGLLLEEYTTNLLLRQIVSAQI
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
63 LLTQDDFVDKRRYKNAHQALSVLLNRGAIPIINENDSVVIDELKVGDNDTLSAQVAAMVQ
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
64 ADLLVFLTDVDGLYTGNPNSDPRAKRLERIETINREIIDMAGGAGSSNGTGGMLTKIKAA
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
65 TIATESGVPVYICSSLKSDSMIEAAEETEDGSYFVAQEKGLRTQKQWLAFYAQSQGSIWV
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
66 DKGAAEALSQYGKSLLLSGIVEAEGVFSYGDIVTVFDKESGKSLGKGRVQFGASALEDML
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
67 RSQKAKGVLIYRDDWISITPEIQLLFTEF
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
68 ...
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
69 >match$8_84_314
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
70 KRIVFKVGTSSLTNEDGSLSRSKVKDITQQLAMLHEAGHELILVSSGAIAAGFGALGFKK
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
71 RPTKIADKQASAAVGQGLLLEEYTTNLLLRQIVSAQILLTQDDFVDKRRYKNAHQALSVL
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
72 LNRGAIPIINENDSVVIDELKVGDNDTLSAQVAAMVQADLLVFLTDVDGLYTGNPNSDPR
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
73 AKRLERIETINREIIDMAGGAGSSNGTGGMLTKIKAATIATESGVPVYICS
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
74
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
75
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
76
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
77 ----------
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
78 References
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
79 ----------
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
80
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
81
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
82 If you use this Galaxy tool in work leading to a scientific publication please
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
83 cite the following papers:
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
84
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
85 Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013).
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
86 Galaxy tools and workflows for sequence analysis with applications
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
87 in molecular plant pathology. PeerJ 1:e167
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
88 http://dx.doi.org/10.7717/peerj.167
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
89
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
90 Zdobnov EM, Apweiler R (2001)
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
91 InterProScan an integration platform for the signature-recognition methods in InterPro.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
92 Bioinformatics 17, 847-848.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
93 http://dx.doi.org/10.1093/bioinformatics/17.9.847
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
94
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
95 Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R (2005)
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
96 InterProScan: protein domains identifier.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
97 Nucleic Acids Research 33 (Web Server issue), W116-W120.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
98 http://dx.doi.org/10.1093/nar/gki442
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
99
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
100 Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, Finn RD, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Laugraud A, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Mulder N, Natale D, Orengo C, Quinn AF, Selengut JD, Sigrist CJ, Thimma M, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. (2009)
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
101 InterPro: the integrative protein signature database.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
102 Nucleic Acids Research 37 (Database Issue), D224-228.
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
103 http://dx.doi.org/10.1093/nar/gkn785
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
104
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
105
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
106 This wrapper is available to install into other Galaxy Instances via the Galaxy Tool Shed at
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
107 http://toolshed.g2.bx.psu.edu/view/bgruening/interproscan5
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
108
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
109
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
110 **Galaxy Wrapper Author**::
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
111
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
112 * Fernando Pérez
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
113 * Ginés Almagro
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
114 * Laura Entrambasaguas
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
115 </help>
c342ebb50f0b Uploaded
fernando
parents:
diff changeset
116 </tool>