comparison tools/ncbi_blast_plus/blastxml_to_tabular.xml @ 8:1f546099212f draft

Uploaded v0.0.17, default to extended 24 column tabular output (rather than standard 12 column output). This should avoid many cases of repeated BLAST jobs being run due to later needing the extra columns.
author peterjc
date Tue, 19 Feb 2013 12:49:43 -0500
parents 643338ac83c0
children 70e7dcbf6573
comparison
equal deleted inserted replaced
7:4ce66a5401d0 8:1f546099212f
1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.8"> 1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.10">
2 <description>Convert BLAST XML output to tabular</description> 2 <description>Convert BLAST XML output to tabular</description>
3 <command interpreter="python"> 3 <command interpreter="python">
4 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format 4 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format
5 </command> 5 </command>
6 <stdio>
7 <!-- Anything other than zero is an error -->
8 <exit_code range="1:" />
9 <exit_code range=":-1" />
10 </stdio>
6 <inputs> 11 <inputs>
7 <param name="blastxml_file" type="data" format="blastxml" label="BLAST results as XML"/> 12 <param name="blastxml_file" type="data" format="blastxml" label="BLAST results as XML"/>
8 <param name="out_format" type="select" label="Output format"> 13 <param name="out_format" type="select" label="Output format">
9 <option value="std" selected="True">Tabular (standard 12 columns)</option> 14 <option value="std">Tabular (standard 12 columns)</option>
10 <option value="ext">Tabular (extended 24 columns)</option> 15 <option value="ext" selected="True">Tabular (extended 24 columns)</option>
11 </param> 16 </param>
12 </inputs> 17 </inputs>
13 <outputs> 18 <outputs>
14 <data name="tabular_file" format="tabular" label="BLAST results as tabular" /> 19 <data name="tabular_file" format="tabular" label="BLAST results as tabular" />
15 </outputs> 20 </outputs>
72 NCBI BLAST+ (and the older NCBI 'legacy' BLAST) can output in a range of 77 NCBI BLAST+ (and the older NCBI 'legacy' BLAST) can output in a range of
73 formats including tabular and a more detailed XML format. A complex workflow 78 formats including tabular and a more detailed XML format. A complex workflow
74 may need both the XML and the tabular output - but running BLAST twice is 79 may need both the XML and the tabular output - but running BLAST twice is
75 slow and wasteful. 80 slow and wasteful.
76 81
77 This tool takes the BLAST XML output and by default converts it into the 82 This tool takes the BLAST XML output and can convert it into the
78 standard 12 column tabular equivalent: 83 standard 12 column tabular equivalent:
79 84
80 ====== ========= ============================================ 85 ====== ========= ============================================
81 Column NCBI name Description 86 Column NCBI name Description
82 ------ --------- -------------------------------------------- 87 ------ --------- --------------------------------------------
97 The BLAST+ tools can optionally output additional columns of information, 102 The BLAST+ tools can optionally output additional columns of information,
98 but this takes longer to calculate. Most (but not all) of these columns are 103 but this takes longer to calculate. Most (but not all) of these columns are
99 included by selecting the extended tabular output. The extra columns are 104 included by selecting the extended tabular output. The extra columns are
100 included *after* the standard 12 columns. This is so that you can write 105 included *after* the standard 12 columns. This is so that you can write
101 workflow filtering steps that accept either the 12 or 22 column tabular 106 workflow filtering steps that accept either the 12 or 22 column tabular
102 BLAST output. 107 BLAST output. This tool now uses this extended 24 column output by default.
103 108
104 ====== ============= =========================================== 109 ====== ============= ===========================================
105 Column NCBI name Description 110 Column NCBI name Description
106 ------ ------------- ------------------------------------------- 111 ------ ------------- -------------------------------------------
107 13 sallseqid All subject Seq-id(s), separated by a ';' 112 13 sallseqid All subject Seq-id(s), separated by a ';'
119 ====== ============= =========================================== 124 ====== ============= ===========================================
120 125
121 Beware that the XML file (and thus the conversion) and the tabular output 126 Beware that the XML file (and thus the conversion) and the tabular output
122 direct from BLAST+ may differ in the presence of XXXX masking on regions 127 direct from BLAST+ may differ in the presence of XXXX masking on regions
123 low complexity (columns 21 and 22), and thus also calculated figures like 128 low complexity (columns 21 and 22), and thus also calculated figures like
124 the percentage idenity (column 3). 129 the percentage identity (column 3).
125 130
126 </help> 131 </help>
127 </tool> 132 </tool>