Mercurial > repos > devteam > ncbi_blast_plus
comparison tools/ncbi_blast_plus/blastxml_to_tabular.xml @ 8:1f546099212f draft
Uploaded v0.0.17, default to extended 24 column tabular output (rather than standard 12 column output). This should avoid many cases of repeated BLAST jobs being run due to later needing the extra columns.
author | peterjc |
---|---|
date | Tue, 19 Feb 2013 12:49:43 -0500 |
parents | 643338ac83c0 |
children | 70e7dcbf6573 |
comparison
equal
deleted
inserted
replaced
7:4ce66a5401d0 | 8:1f546099212f |
---|---|
1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.8"> | 1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.10"> |
2 <description>Convert BLAST XML output to tabular</description> | 2 <description>Convert BLAST XML output to tabular</description> |
3 <command interpreter="python"> | 3 <command interpreter="python"> |
4 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format | 4 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format |
5 </command> | 5 </command> |
6 <stdio> | |
7 <!-- Anything other than zero is an error --> | |
8 <exit_code range="1:" /> | |
9 <exit_code range=":-1" /> | |
10 </stdio> | |
6 <inputs> | 11 <inputs> |
7 <param name="blastxml_file" type="data" format="blastxml" label="BLAST results as XML"/> | 12 <param name="blastxml_file" type="data" format="blastxml" label="BLAST results as XML"/> |
8 <param name="out_format" type="select" label="Output format"> | 13 <param name="out_format" type="select" label="Output format"> |
9 <option value="std" selected="True">Tabular (standard 12 columns)</option> | 14 <option value="std">Tabular (standard 12 columns)</option> |
10 <option value="ext">Tabular (extended 24 columns)</option> | 15 <option value="ext" selected="True">Tabular (extended 24 columns)</option> |
11 </param> | 16 </param> |
12 </inputs> | 17 </inputs> |
13 <outputs> | 18 <outputs> |
14 <data name="tabular_file" format="tabular" label="BLAST results as tabular" /> | 19 <data name="tabular_file" format="tabular" label="BLAST results as tabular" /> |
15 </outputs> | 20 </outputs> |
72 NCBI BLAST+ (and the older NCBI 'legacy' BLAST) can output in a range of | 77 NCBI BLAST+ (and the older NCBI 'legacy' BLAST) can output in a range of |
73 formats including tabular and a more detailed XML format. A complex workflow | 78 formats including tabular and a more detailed XML format. A complex workflow |
74 may need both the XML and the tabular output - but running BLAST twice is | 79 may need both the XML and the tabular output - but running BLAST twice is |
75 slow and wasteful. | 80 slow and wasteful. |
76 | 81 |
77 This tool takes the BLAST XML output and by default converts it into the | 82 This tool takes the BLAST XML output and can convert it into the |
78 standard 12 column tabular equivalent: | 83 standard 12 column tabular equivalent: |
79 | 84 |
80 ====== ========= ============================================ | 85 ====== ========= ============================================ |
81 Column NCBI name Description | 86 Column NCBI name Description |
82 ------ --------- -------------------------------------------- | 87 ------ --------- -------------------------------------------- |
97 The BLAST+ tools can optionally output additional columns of information, | 102 The BLAST+ tools can optionally output additional columns of information, |
98 but this takes longer to calculate. Most (but not all) of these columns are | 103 but this takes longer to calculate. Most (but not all) of these columns are |
99 included by selecting the extended tabular output. The extra columns are | 104 included by selecting the extended tabular output. The extra columns are |
100 included *after* the standard 12 columns. This is so that you can write | 105 included *after* the standard 12 columns. This is so that you can write |
101 workflow filtering steps that accept either the 12 or 22 column tabular | 106 workflow filtering steps that accept either the 12 or 22 column tabular |
102 BLAST output. | 107 BLAST output. This tool now uses this extended 24 column output by default. |
103 | 108 |
104 ====== ============= =========================================== | 109 ====== ============= =========================================== |
105 Column NCBI name Description | 110 Column NCBI name Description |
106 ------ ------------- ------------------------------------------- | 111 ------ ------------- ------------------------------------------- |
107 13 sallseqid All subject Seq-id(s), separated by a ';' | 112 13 sallseqid All subject Seq-id(s), separated by a ';' |
119 ====== ============= =========================================== | 124 ====== ============= =========================================== |
120 | 125 |
121 Beware that the XML file (and thus the conversion) and the tabular output | 126 Beware that the XML file (and thus the conversion) and the tabular output |
122 direct from BLAST+ may differ in the presence of XXXX masking on regions | 127 direct from BLAST+ may differ in the presence of XXXX masking on regions |
123 low complexity (columns 21 and 22), and thus also calculated figures like | 128 low complexity (columns 21 and 22), and thus also calculated figures like |
124 the percentage idenity (column 3). | 129 the percentage identity (column 3). |
125 | 130 |
126 </help> | 131 </help> |
127 </tool> | 132 </tool> |