comparison tools/ncbi_blast_plus/blastxml_to_tabular.xml @ 11:4c4a0da938ff draft

Uploaded v0.0.22, now wraps BLAST+ 2.2.28 allowing extended tabular output to include the hit descriptions as column 25. Supports $GALAXY_SLOTS. Includes more tests and heavy use of macros.
author peterjc
date Thu, 05 Dec 2013 06:55:59 -0500
parents 70e7dcbf6573
children 623f727cdff1
comparison
equal deleted inserted replaced
10:70e7dcbf6573 11:4c4a0da938ff
1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.11"> 1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.22">
2 <description>Convert BLAST XML output to tabular</description> 2 <description>Convert BLAST XML output to tabular</description>
3 <version_command interpreter="python">blastxml_to_tabular.py --version</version_command> 3 <version_command interpreter="python">blastxml_to_tabular.py --version</version_command>
4 <command interpreter="python"> 4 <command interpreter="python">
5 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format 5 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format
6 </command> 6 </command>
15 <option value="std">Tabular (standard 12 columns)</option> 15 <option value="std">Tabular (standard 12 columns)</option>
16 <option value="ext" selected="True">Tabular (extended 24 columns)</option> 16 <option value="ext" selected="True">Tabular (extended 24 columns)</option>
17 </param> 17 </param>
18 </inputs> 18 </inputs>
19 <outputs> 19 <outputs>
20 <data name="tabular_file" format="tabular" label="BLAST results as tabular" /> 20 <data name="tabular_file" format="tabular" label="$blastxml_file.display_name (as tabular)" />
21 </outputs> 21 </outputs>
22 <requirements> 22 <requirements>
23 </requirements> 23 </requirements>
24 <tests> 24 <tests>
25 <test> 25 <test>
68 <param name="blastxml_file" value="blastp_human_vs_pdb_seg_no.xml" ftype="blastxml" /> 68 <param name="blastxml_file" value="blastp_human_vs_pdb_seg_no.xml" ftype="blastxml" />
69 <param name="out_format" value="ext" /> 69 <param name="out_format" value="ext" />
70 <!-- Note this has some white space differences from the actual blastp output --> 70 <!-- Note this has some white space differences from the actual blastp output -->
71 <output name="tabular_file" file="blastp_human_vs_pdb_seg_no_converted_ext.tabular" ftype="tabular" /> 71 <output name="tabular_file" file="blastp_human_vs_pdb_seg_no_converted_ext.tabular" ftype="tabular" />
72 </test> 72 </test>
73 <test>
74 <param name="blastxml_file" value="blastn_arabidopsis.xml" ftype="blastxml" />
75 <param name="out_format" value="std" />
76 <output name="tabular_file" file="blastn_arabidopsis.standard.tabular" ftype="tabular" />
77 </test>
78 <test>
79 <param name="blastxml_file" value="blastn_arabidopsis.xml" ftype="blastxml" />
80 <param name="out_format" value="ext" />
81 <output name="tabular_file" file="blastn_arabidopsis.extended.tabular" ftype="tabular" />
82 </test>
73 </tests> 83 </tests>
74 <help> 84 <help>
75 85
76 **What it does** 86 **What it does**
77 87
102 112
103 The BLAST+ tools can optionally output additional columns of information, 113 The BLAST+ tools can optionally output additional columns of information,
104 but this takes longer to calculate. Most (but not all) of these columns are 114 but this takes longer to calculate. Most (but not all) of these columns are
105 included by selecting the extended tabular output. The extra columns are 115 included by selecting the extended tabular output. The extra columns are
106 included *after* the standard 12 columns. This is so that you can write 116 included *after* the standard 12 columns. This is so that you can write
107 workflow filtering steps that accept either the 12 or 22 column tabular 117 workflow filtering steps that accept either the 12 or 25 column tabular
108 BLAST output. This tool now uses this extended 24 column output by default. 118 BLAST output. This tool now uses this extended 25 column output by default.
109 119
110 ====== ============= =========================================== 120 ====== ============= ===========================================
111 Column NCBI name Description 121 Column NCBI name Description
112 ------ ------------- ------------------------------------------- 122 ------ ------------- -------------------------------------------
113 13 sallseqid All subject Seq-id(s), separated by a ';' 123 13 sallseqid All subject Seq-id(s), separated by ';'
114 14 score Raw score 124 14 score Raw score
115 15 nident Number of identical matches 125 15 nident Number of identical matches
116 16 positive Number of positive-scoring matches 126 16 positive Number of positive-scoring matches
117 17 gaps Total number of gaps 127 17 gaps Total number of gaps
118 18 ppos Percentage of positive-scoring matches 128 18 ppos Percentage of positive-scoring matches
120 20 sframe Subject frame 130 20 sframe Subject frame
121 21 qseq Aligned part of query sequence 131 21 qseq Aligned part of query sequence
122 22 sseq Aligned part of subject sequence 132 22 sseq Aligned part of subject sequence
123 23 qlen Query sequence length 133 23 qlen Query sequence length
124 24 slen Subject sequence length 134 24 slen Subject sequence length
135 25 salltitles All subject title(s), separated by '&lt;&gt;'
125 ====== ============= =========================================== 136 ====== ============= ===========================================
126 137
127 Beware that the XML file (and thus the conversion) and the tabular output 138 Beware that the XML file (and thus the conversion) and the tabular output
128 direct from BLAST+ may differ in the presence of XXXX masking on regions 139 direct from BLAST+ may differ in the presence of XXXX masking on regions
129 low complexity (columns 21 and 22), and thus also calculated figures like 140 low complexity (columns 21 and 22), and thus also calculated figures like