annotate tools/ncbi_blast_plus/ncbi_makeblastdb.xml @ 9:9dabbfd73c8a draft

Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database. Also includes other minor improvements.
author peterjc
date Thu, 25 Apr 2013 09:38:37 -0400
parents 393a7a35383c
children 70e7dcbf6573
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
9
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
1 <tool id="ncbi_makeblastdb" name="NCBI BLAST+ makeblastdb" version="0.0.4">
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
2 <description>Make BLAST database</description>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
3 <requirements>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
4 <requirement type="binary">makeblastdb</requirement>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
5 <requirement type="package" version="2.2.26+">blast+</requirement>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
6 </requirements>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
7 <version_command>makeblastdb -version</version_command>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
8 <command>
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
9 makeblastdb -out "${os.path.join($outfile.extra_files_path,'blastdb')}"
5
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
10 $parse_seqids
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
11 $hash_index
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
12 ## Single call to -in with multiple filenames space separated with outer quotes
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
13 ## (presumably any filenames with spaces would be a problem). Note this gives
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
14 ## some extra spaces, e.g. -in " file1 file2 file3 " but BLAST seems happy:
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
15 -in "
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
16 #for $i in $in
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
17 ${i.file} #end for
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
18 "
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
19 #if $title:
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
20 -title "$title"
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
21 #else:
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
22 ##Would default to being based on the cryptic Galaxy filenames, which is unhelpful
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
23 -title "BLAST Database"
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
24 #end if
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
25 -dbtype $dbtype
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
26 ## #set $sep = '-mask_data '
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
27 ## #for $i in $mask_data
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
28 ## $sep${i.file}
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
29 ## #set $set = ', '
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
30 ## #end for
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
31 ## #set $sep = '-gi_mask -gi_mask_name '
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
32 ## #for $i in $gi_mask
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
33 ## $sep${i.file}
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
34 ## #set $set = ', '
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
35 ## #end for
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
36 ## #if $tax.select == 'id':
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
37 ## -taxid $tax.id
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
38 ## #else if $tax.select == 'map':
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
39 ## -taxid_map $tax.map
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
40 ## #end if
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
41 </command>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
42 <stdio>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
43 <!-- Anything other than zero is an error -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
44 <exit_code range="1:" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
45 <exit_code range=":-1" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
46 <!-- In case the return code has not been set propery check stderr too -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
47 <regex match="Error:" />
9
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
48 <regex match="Exception:" />
5
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
49 </stdio>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
50 <inputs>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
51 <param name="dbtype" type="select" display="radio" label="Molecule type of input">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
52 <option value="prot">protein</option>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
53 <option value="nucl">nucleotide</option>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
54 </param>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
55 <!-- TODO Allow merging of existing BLAST databases (conditional on the database type)
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
56 <repeat name="in" title="Blast or Fasta Database" min="1">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
57 <param name="file" type="data" format="fasta,blastdbn,blastdbp" label="Blast or Fasta database" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
58 </repeat>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
59 -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
60 <repeat name="in" title="FASTA file" min="1">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
61 <param name="file" type="data" format="fasta" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
62 </repeat>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
63 <param name="title" type="text" value="" label="Title for BLAST database" help="This is the database name shown in BLAST search output" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
64 <param name="parse_seqids" type="boolean" truevalue="-parse_seqids" falsevalue="" checked="False" label="Parse the sequence identifiers" help="This is only advised if your FASTA file follows the NCBI naming conventions using pipe '|' symbols" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
65 <param name="hash_index" type="boolean" truevalue="-hash_index" falsevalue="" checked="true" label="Enable the creation of sequence hash values." help="These hash values can then be used to quickly determine if a given sequence data exists in this BLAST database." />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
66
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
67 <!-- SEQUENCE MASKING OPTIONS -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
68 <!-- TODO
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
69 <repeat name="mask_data" title="Provide one or more files containing masking data">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
70 <param name="file" type="data" format="asnb" label="File containing masking data" help="As produced by NCBI masking applications (e.g. dustmasker, segmasker, windowmasker)" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
71 </repeat>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
72 <repeat name="gi_mask" title="Create GI indexed masking data">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
73 <param name="file" type="data" format="asnb" label="Masking data output file" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
74 </repeat>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
75 -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
76
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
77 <!-- TAXONOMY OPTIONS -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
78 <!-- TODO
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
79 <conditional name="tax">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
80 <param name="select" type="select" label="Taxonomy options">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
81 <option value="">Do not assign sequences to Taxonomy IDs</option>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
82 <option value="id">Assign all sequences to one Taxonomy ID</option>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
83 <option value="map">Supply text file mapping sequence IDs to taxnomy IDs</option>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
84 </param>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
85 <when value="">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
86 </when>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
87 <when value="id">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
88 <param name="id" type="integer" value="" label="NCBI taxonomy ID" help="Integer &gt;=0" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
89 </when>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
90 <when value="map">
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
91 <param name="file" type="data" format="txt" label="Seq ID : Tax ID mapping file" help="Format: SequenceId TaxonomyId" />
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
92 </when>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
93 </conditional>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
94 -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
95 </inputs>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
96 <outputs>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
97 <!-- If we only accepted one FASTA file, we could use its human name here... -->
9
9dabbfd73c8a Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents: 5
diff changeset
98 <data name="outfile" format="data" label="${dbtype.value_label} BLAST database from ${on_string}">
5
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
99 <change_format>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
100 <when input="dbtype" value="nucl" format="blastdbn"/>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
101 <when input="dbtype" value="prot" format="blastdbp"/>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
102 </change_format>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
103 </data>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
104 </outputs>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
105 <help>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
106 **What it does**
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
107
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
108 Make BLAST database from one or more FASTA files and/or BLAST databases.
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
109
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
110 This is a wrapper for the NCBI BLAST+ tool 'makeblastdb', which is the
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
111 replacement for the 'formatdb' tool in the NCBI 'legacy' BLAST suite.
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
112
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
113 <!--
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
114 Applying masks to an existing BLAST database will not change the original database; a new database will be created.
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
115 For this reason, it's best to apply all masks at once to minimize the number of unnecessary intermediate databases.
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
116 -->
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
117
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
118 **Documentation**
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
119
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
120 http://www.ncbi.nlm.nih.gov/books/NBK1763/
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
121 </help>
393a7a35383c Uploaded v0.0.14 adding local BLAST database support.
peterjc
parents:
diff changeset
122 </tool>