Mercurial > repos > devteam > ncbi_blast_plus
annotate tools/ncbi_blast_plus/ncbi_makeblastdb.xml @ 9:9dabbfd73c8a draft
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
Also includes other minor improvements.
author | peterjc |
---|---|
date | Thu, 25 Apr 2013 09:38:37 -0400 |
parents | 393a7a35383c |
children | 70e7dcbf6573 |
rev | line source |
---|---|
9
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
1 <tool id="ncbi_makeblastdb" name="NCBI BLAST+ makeblastdb" version="0.0.4"> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
2 <description>Make BLAST database</description> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
3 <requirements> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
4 <requirement type="binary">makeblastdb</requirement> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
5 <requirement type="package" version="2.2.26+">blast+</requirement> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
6 </requirements> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
7 <version_command>makeblastdb -version</version_command> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
8 <command> |
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
9 makeblastdb -out "${os.path.join($outfile.extra_files_path,'blastdb')}" |
5 | 10 $parse_seqids |
11 $hash_index | |
12 ## Single call to -in with multiple filenames space separated with outer quotes | |
13 ## (presumably any filenames with spaces would be a problem). Note this gives | |
14 ## some extra spaces, e.g. -in " file1 file2 file3 " but BLAST seems happy: | |
15 -in " | |
16 #for $i in $in | |
17 ${i.file} #end for | |
18 " | |
19 #if $title: | |
20 -title "$title" | |
21 #else: | |
22 ##Would default to being based on the cryptic Galaxy filenames, which is unhelpful | |
23 -title "BLAST Database" | |
24 #end if | |
25 -dbtype $dbtype | |
26 ## #set $sep = '-mask_data ' | |
27 ## #for $i in $mask_data | |
28 ## $sep${i.file} | |
29 ## #set $set = ', ' | |
30 ## #end for | |
31 ## #set $sep = '-gi_mask -gi_mask_name ' | |
32 ## #for $i in $gi_mask | |
33 ## $sep${i.file} | |
34 ## #set $set = ', ' | |
35 ## #end for | |
36 ## #if $tax.select == 'id': | |
37 ## -taxid $tax.id | |
38 ## #else if $tax.select == 'map': | |
39 ## -taxid_map $tax.map | |
40 ## #end if | |
41 </command> | |
42 <stdio> | |
43 <!-- Anything other than zero is an error --> | |
44 <exit_code range="1:" /> | |
45 <exit_code range=":-1" /> | |
46 <!-- In case the return code has not been set propery check stderr too --> | |
47 <regex match="Error:" /> | |
9
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
48 <regex match="Exception:" /> |
5 | 49 </stdio> |
50 <inputs> | |
51 <param name="dbtype" type="select" display="radio" label="Molecule type of input"> | |
52 <option value="prot">protein</option> | |
53 <option value="nucl">nucleotide</option> | |
54 </param> | |
55 <!-- TODO Allow merging of existing BLAST databases (conditional on the database type) | |
56 <repeat name="in" title="Blast or Fasta Database" min="1"> | |
57 <param name="file" type="data" format="fasta,blastdbn,blastdbp" label="Blast or Fasta database" /> | |
58 </repeat> | |
59 --> | |
60 <repeat name="in" title="FASTA file" min="1"> | |
61 <param name="file" type="data" format="fasta" /> | |
62 </repeat> | |
63 <param name="title" type="text" value="" label="Title for BLAST database" help="This is the database name shown in BLAST search output" /> | |
64 <param name="parse_seqids" type="boolean" truevalue="-parse_seqids" falsevalue="" checked="False" label="Parse the sequence identifiers" help="This is only advised if your FASTA file follows the NCBI naming conventions using pipe '|' symbols" /> | |
65 <param name="hash_index" type="boolean" truevalue="-hash_index" falsevalue="" checked="true" label="Enable the creation of sequence hash values." help="These hash values can then be used to quickly determine if a given sequence data exists in this BLAST database." /> | |
66 | |
67 <!-- SEQUENCE MASKING OPTIONS --> | |
68 <!-- TODO | |
69 <repeat name="mask_data" title="Provide one or more files containing masking data"> | |
70 <param name="file" type="data" format="asnb" label="File containing masking data" help="As produced by NCBI masking applications (e.g. dustmasker, segmasker, windowmasker)" /> | |
71 </repeat> | |
72 <repeat name="gi_mask" title="Create GI indexed masking data"> | |
73 <param name="file" type="data" format="asnb" label="Masking data output file" /> | |
74 </repeat> | |
75 --> | |
76 | |
77 <!-- TAXONOMY OPTIONS --> | |
78 <!-- TODO | |
79 <conditional name="tax"> | |
80 <param name="select" type="select" label="Taxonomy options"> | |
81 <option value="">Do not assign sequences to Taxonomy IDs</option> | |
82 <option value="id">Assign all sequences to one Taxonomy ID</option> | |
83 <option value="map">Supply text file mapping sequence IDs to taxnomy IDs</option> | |
84 </param> | |
85 <when value=""> | |
86 </when> | |
87 <when value="id"> | |
88 <param name="id" type="integer" value="" label="NCBI taxonomy ID" help="Integer >=0" /> | |
89 </when> | |
90 <when value="map"> | |
91 <param name="file" type="data" format="txt" label="Seq ID : Tax ID mapping file" help="Format: SequenceId TaxonomyId" /> | |
92 </when> | |
93 </conditional> | |
94 --> | |
95 </inputs> | |
96 <outputs> | |
97 <!-- If we only accepted one FASTA file, we could use its human name here... --> | |
9
9dabbfd73c8a
Uploaded v0.0.19, adds wrappers for rpsblast and rpstblastn with new blastdb_d.loc file for their protein domain database.
peterjc
parents:
5
diff
changeset
|
98 <data name="outfile" format="data" label="${dbtype.value_label} BLAST database from ${on_string}"> |
5 | 99 <change_format> |
100 <when input="dbtype" value="nucl" format="blastdbn"/> | |
101 <when input="dbtype" value="prot" format="blastdbp"/> | |
102 </change_format> | |
103 </data> | |
104 </outputs> | |
105 <help> | |
106 **What it does** | |
107 | |
108 Make BLAST database from one or more FASTA files and/or BLAST databases. | |
109 | |
110 This is a wrapper for the NCBI BLAST+ tool 'makeblastdb', which is the | |
111 replacement for the 'formatdb' tool in the NCBI 'legacy' BLAST suite. | |
112 | |
113 <!-- | |
114 Applying masks to an existing BLAST database will not change the original database; a new database will be created. | |
115 For this reason, it's best to apply all masks at once to minimize the number of unnecessary intermediate databases. | |
116 --> | |
117 | |
118 **Documentation** | |
119 | |
120 http://www.ncbi.nlm.nih.gov/books/NBK1763/ | |
121 </help> | |
122 </tool> |