Mercurial > repos > devteam > ncbi_blast_plus
comparison tools/ncbi_blast_plus/blastxml_to_tabular.xml @ 11:4c4a0da938ff draft
Uploaded v0.0.22, now wraps BLAST+ 2.2.28 allowing extended tabular output to include the hit descriptions as column 25.
Supports $GALAXY_SLOTS.
Includes more tests and heavy use of macros.
author | peterjc |
---|---|
date | Thu, 05 Dec 2013 06:55:59 -0500 |
parents | 70e7dcbf6573 |
children | 623f727cdff1 |
comparison
equal
deleted
inserted
replaced
10:70e7dcbf6573 | 11:4c4a0da938ff |
---|---|
1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.11"> | 1 <tool id="blastxml_to_tabular" name="BLAST XML to tabular" version="0.0.22"> |
2 <description>Convert BLAST XML output to tabular</description> | 2 <description>Convert BLAST XML output to tabular</description> |
3 <version_command interpreter="python">blastxml_to_tabular.py --version</version_command> | 3 <version_command interpreter="python">blastxml_to_tabular.py --version</version_command> |
4 <command interpreter="python"> | 4 <command interpreter="python"> |
5 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format | 5 blastxml_to_tabular.py $blastxml_file $tabular_file $out_format |
6 </command> | 6 </command> |
15 <option value="std">Tabular (standard 12 columns)</option> | 15 <option value="std">Tabular (standard 12 columns)</option> |
16 <option value="ext" selected="True">Tabular (extended 24 columns)</option> | 16 <option value="ext" selected="True">Tabular (extended 24 columns)</option> |
17 </param> | 17 </param> |
18 </inputs> | 18 </inputs> |
19 <outputs> | 19 <outputs> |
20 <data name="tabular_file" format="tabular" label="BLAST results as tabular" /> | 20 <data name="tabular_file" format="tabular" label="$blastxml_file.display_name (as tabular)" /> |
21 </outputs> | 21 </outputs> |
22 <requirements> | 22 <requirements> |
23 </requirements> | 23 </requirements> |
24 <tests> | 24 <tests> |
25 <test> | 25 <test> |
68 <param name="blastxml_file" value="blastp_human_vs_pdb_seg_no.xml" ftype="blastxml" /> | 68 <param name="blastxml_file" value="blastp_human_vs_pdb_seg_no.xml" ftype="blastxml" /> |
69 <param name="out_format" value="ext" /> | 69 <param name="out_format" value="ext" /> |
70 <!-- Note this has some white space differences from the actual blastp output --> | 70 <!-- Note this has some white space differences from the actual blastp output --> |
71 <output name="tabular_file" file="blastp_human_vs_pdb_seg_no_converted_ext.tabular" ftype="tabular" /> | 71 <output name="tabular_file" file="blastp_human_vs_pdb_seg_no_converted_ext.tabular" ftype="tabular" /> |
72 </test> | 72 </test> |
73 <test> | |
74 <param name="blastxml_file" value="blastn_arabidopsis.xml" ftype="blastxml" /> | |
75 <param name="out_format" value="std" /> | |
76 <output name="tabular_file" file="blastn_arabidopsis.standard.tabular" ftype="tabular" /> | |
77 </test> | |
78 <test> | |
79 <param name="blastxml_file" value="blastn_arabidopsis.xml" ftype="blastxml" /> | |
80 <param name="out_format" value="ext" /> | |
81 <output name="tabular_file" file="blastn_arabidopsis.extended.tabular" ftype="tabular" /> | |
82 </test> | |
73 </tests> | 83 </tests> |
74 <help> | 84 <help> |
75 | 85 |
76 **What it does** | 86 **What it does** |
77 | 87 |
102 | 112 |
103 The BLAST+ tools can optionally output additional columns of information, | 113 The BLAST+ tools can optionally output additional columns of information, |
104 but this takes longer to calculate. Most (but not all) of these columns are | 114 but this takes longer to calculate. Most (but not all) of these columns are |
105 included by selecting the extended tabular output. The extra columns are | 115 included by selecting the extended tabular output. The extra columns are |
106 included *after* the standard 12 columns. This is so that you can write | 116 included *after* the standard 12 columns. This is so that you can write |
107 workflow filtering steps that accept either the 12 or 22 column tabular | 117 workflow filtering steps that accept either the 12 or 25 column tabular |
108 BLAST output. This tool now uses this extended 24 column output by default. | 118 BLAST output. This tool now uses this extended 25 column output by default. |
109 | 119 |
110 ====== ============= =========================================== | 120 ====== ============= =========================================== |
111 Column NCBI name Description | 121 Column NCBI name Description |
112 ------ ------------- ------------------------------------------- | 122 ------ ------------- ------------------------------------------- |
113 13 sallseqid All subject Seq-id(s), separated by a ';' | 123 13 sallseqid All subject Seq-id(s), separated by ';' |
114 14 score Raw score | 124 14 score Raw score |
115 15 nident Number of identical matches | 125 15 nident Number of identical matches |
116 16 positive Number of positive-scoring matches | 126 16 positive Number of positive-scoring matches |
117 17 gaps Total number of gaps | 127 17 gaps Total number of gaps |
118 18 ppos Percentage of positive-scoring matches | 128 18 ppos Percentage of positive-scoring matches |
120 20 sframe Subject frame | 130 20 sframe Subject frame |
121 21 qseq Aligned part of query sequence | 131 21 qseq Aligned part of query sequence |
122 22 sseq Aligned part of subject sequence | 132 22 sseq Aligned part of subject sequence |
123 23 qlen Query sequence length | 133 23 qlen Query sequence length |
124 24 slen Subject sequence length | 134 24 slen Subject sequence length |
135 25 salltitles All subject title(s), separated by '<>' | |
125 ====== ============= =========================================== | 136 ====== ============= =========================================== |
126 | 137 |
127 Beware that the XML file (and thus the conversion) and the tabular output | 138 Beware that the XML file (and thus the conversion) and the tabular output |
128 direct from BLAST+ may differ in the presence of XXXX masking on regions | 139 direct from BLAST+ may differ in the presence of XXXX masking on regions |
129 low complexity (columns 21 and 22), and thus also calculated figures like | 140 low complexity (columns 21 and 22), and thus also calculated figures like |