Mercurial > repos > peterjc > blast_rbh
annotate tools/blast_rbh/blast_rbh.xml @ 6:b2f91cbed8d9 draft default tip
v0.3.0 - Updated to BLAST+ 2.9.0 via conda; removed legacy tool packages.
author | peterjc |
---|---|
date | Tue, 06 Dec 2022 16:03:13 +0000 |
parents | 8f4500f6f2aa |
children |
rev | line source |
---|---|
6
b2f91cbed8d9
v0.3.0 - Updated to BLAST+ 2.9.0 via conda; removed legacy tool packages.
peterjc
parents:
5
diff
changeset
|
1 <tool id="blast_reciprocal_best_hits" name="BLAST Reciprocal Best Hits (RBH)" version="0.3.0"> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
2 <description>from two FASTA files</description> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
3 <requirements> |
6
b2f91cbed8d9
v0.3.0 - Updated to BLAST+ 2.9.0 via conda; removed legacy tool packages.
peterjc
parents:
5
diff
changeset
|
4 <requirement type="package" version="1.77">biopython</requirement> |
b2f91cbed8d9
v0.3.0 - Updated to BLAST+ 2.9.0 via conda; removed legacy tool packages.
peterjc
parents:
5
diff
changeset
|
5 <requirement type="package" version="2.9.0">blast</requirement> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
6 </requirements> |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
7 <version_command> |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
8 python $__tool_directory__/blast_rbh.py --version |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
9 </version_command> |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
10 <command detect_errors="aggressive"> |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
11 python $__tool_directory__/blast_rbh.py '$fasta_a' '$fasta_b' |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
12 -a $seq.dbtype |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
13 #if $seq.dbtype=="nucl" |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
14 -t $seq.nucl_type |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
15 #else |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
16 -t $seq.prot_type |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
17 #end if |
1 | 18 $make_nr |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
19 -i $identity |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
20 -c $q_cover |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
21 -o '$output' |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
22 </command> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
23 <inputs> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
24 <!-- Galaxy does not have sub-types for protein vs nucletide FASTA --> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
25 <param name="fasta_a" type="data" format="fasta" |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
26 label="Genes/proteins from species A" |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
27 help="FASTA file, one sequence per gene/protein."/> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
28 <param name="fasta_b" type="data" format="fasta" |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
29 label="Genes/proteins from species B" |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
30 help="FASTA file, one sequence per gene/protein."/> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
31 <conditional name="seq"> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
32 <param name="dbtype" type="select" label="Molecule type of FASTA inputs"> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
33 <option value="prot">protein</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
34 <option value="nucl">nucleotide</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
35 </param> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
36 <when value="prot"> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
37 <param name="prot_type" type="select" display="radio" label="Type of BLAST"> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
38 <option value="blastp">blastp - Traditional BLASTP to compare a protein query to a protein database</option> |
2 | 39 <option value="blastp-fast">blastp-fast - Uses longer words as described by Shiryev et al (2007)</option> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
40 <option value="blastp-short">blastp-short - BLASTP optimized for queries shorter than 30 residues</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
41 </param> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
42 </when> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
43 <when value="nucl"> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
44 <param name="nucl_type" type="select" display="radio" label="Type of BLAST"> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
45 <option value="megablast">megablast - Traditional megablast used to find very similar (e.g., intraspecies or closely related species) sequences</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
46 <option value="blastn">blastn - Traditional BLASTN requiring an exact match of 11, for somewhat similar sequences</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
47 <option value="blastn-short">blastn-short - BLASTN program optimized for sequences shorter than 50 bases</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
48 <option value="dc-megablast">dc-megablast - Discontiguous megablast used to find more distant (e.g., interspecies) sequences</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
49 <option value="tblastx">tblastx - TBLASTX program using translated query against translated database (protein level matches)</option> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
50 </param> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
51 </when> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
52 </conditional> |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
53 <param name="identity" type="float" value="70" min="0" max="100" |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
54 label="Minimum percentage identity for BLAST matches" |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
55 help="Default is 70%, use 0 for no filtering." /> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
56 <param name="q_cover" type="float" value="50" min="0" max="100" |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
57 label="Minimum percentage query coverage for BLAST matches" |
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
58 help="Default is 50%, use 0 for no filtering." /> |
1 | 59 <param name="make_nr" type="boolean" checked="false" truevalue="--nr" falsevalue="" |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
60 label="Process input FASTA files to collapse identical sequences" |
1 | 61 help="i.e. First make the input non-redundant" /> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
62 </inputs> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
63 <outputs> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
64 <data name="output" format="tabular" label="BLAST RBH: $fasta_a.name vs $fasta_b.name" /> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
65 </outputs> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
66 <tests> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
67 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
68 <param name="fasta_a" value="four_human_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
69 <param name="fasta_b" value="rhodopsin_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
70 <param name="dbtype" value="prot"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
71 <param name="nucl_type" value="blastp"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
72 <param name="identity" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
73 <param name="q_cover" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
74 <output name="output" file="rbh_blastp_four_human_vs_rhodopsin_proteins.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
75 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
76 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
77 <param name="fasta_a" value="rhodopsin_nucs.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
78 <param name="fasta_b" value="three_human_mRNA.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
79 <param name="dbtype" value="nucl"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
80 <param name="nucl_type" value="megablast"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
81 <param name="identity" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
82 <param name="q_cover" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
83 <output name="output" file="rbh_megablast_rhodopsin_nucs_vs_three_human_mRNA.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
84 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
85 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
86 <param name="fasta_a" value="rhodopsin_nucs.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
87 <param name="fasta_b" value="three_human_mRNA.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
88 <param name="dbtype" value="nucl"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
89 <param name="nucl_type" value="megablast"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
90 <param name="identity" value="92"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
91 <param name="q_cover" value="86"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
92 <output name="output" file="rbh_megablast_rhodopsin_nucs_vs_three_human_mRNA.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
93 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
94 <!-- push the percentage identity over the 92.07% level --> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
95 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
96 <param name="fasta_a" value="rhodopsin_nucs.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
97 <param name="fasta_b" value="three_human_mRNA.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
98 <param name="dbtype" value="nucl"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
99 <param name="nucl_type" value="megablast"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
100 <param name="identity" value="92.5"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
101 <param name="q_cover" value="86"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
102 <output name="output" file="rbh_none.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
103 </test> |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
104 <!-- push the coverage over the 86% level --> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
105 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
106 <param name="fasta_a" value="rhodopsin_nucs.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
107 <param name="fasta_b" value="three_human_mRNA.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
108 <param name="dbtype" value="nucl"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
109 <param name="nucl_type" value="megablast"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
110 <param name="identity" value="92"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
111 <param name="q_cover" value="87"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
112 <output name="output" file="rbh_none.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
113 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
114 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
115 <param name="fasta_a" value="rhodopsin_nucs.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
116 <param name="fasta_b" value="three_human_mRNA.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
117 <param name="dbtype" value="nucl"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
118 <param name="nucl_type" value="tblastx"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
119 <param name="identity" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
120 <param name="q_cover" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
121 <output name="output" file="rbh_tblastx_rhodopsin_nucs_vs_three_human_mRNA.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
122 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
123 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
124 <param name="fasta_a" value="three_human_mRNA.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
125 <param name="fasta_b" value="rhodopsin_nucs.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
126 <param name="dbtype" value="nucl"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
127 <param name="nucl_type" value="blastn"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
128 <param name="identity" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
129 <param name="q_cover" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
130 <output name="output" file="rbh_blastn_three_human_mRNA_vs_rhodopsin_nucs.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
131 </test> |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
132 <!-- this pair of examples test tied best hits --> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
133 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
134 <param name="fasta_a" value="k12_ten_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
135 <param name="fasta_b" value="k12_edited_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
136 <param name="dbtype" value="prot"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
137 <param name="nucl_type" value="blastp"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
138 <param name="identity" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
139 <param name="q_cover" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
140 <output name="output" file="rbh_blastp_k12.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
141 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
142 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
143 <param name="fasta_a" value="k12_edited_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
144 <param name="fasta_b" value="k12_ten_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
145 <param name="dbtype" value="prot"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
146 <param name="nucl_type" value="blastp"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
147 <param name="identity" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
148 <param name="q_cover" value="0.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
149 <output name="output" file="rbh_blastp_k12.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
150 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
151 <!-- this tests self-comparison --> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
152 <test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
153 <param name="fasta_a" value="k12_edited_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
154 <param name="fasta_b" value="k12_edited_proteins.fasta" ftype="fasta"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
155 <param name="dbtype" value="prot"/> |
2 | 156 <param name="nucl_type" value="blastp-fast"/> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
157 <param name="identity" value="80.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
158 <param name="q_cover" value="80.0"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
159 <output name="output" file="rbh_blastp_k12_self.tabular" ftype="tabular"/> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
160 </test> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
161 </tests> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
162 <help> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
163 **What it does** |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
164 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
165 Takes two FASTA files (*species A* and *species B*), builds a BLAST database |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
166 for each, runs reciprocal BLAST searchs (*A vs B*, and *B vs A*), optionally |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
167 filters the HSPs, and then compiles a list of the reciprocal best hits (RBH). |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
168 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
169 The output from this tool is a tabular file containing multiple columns, with |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
170 information about the BLAST matches used: |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
171 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
172 ====== ================================== |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
173 Column Description |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
174 ------ ---------------------------------- |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
175 1 ID from *species A* |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
176 2 ID from *species B* |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
177 3 Length of sequence *A* |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
178 4 Length of sequence *B* |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
179 5 Percentage of sequence *A* covered |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
180 6 Percentage of sequence *B* covered |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
181 7 HSP alignment length |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
182 8 HSP percentage identity |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
183 9 HSP bitscore |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
184 ====== ================================== |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
185 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
186 These values correspond to the ``qseqid``/``sseqid``, ``qlen``/``slen``, |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
187 ``qcovhsp``, ``length``, ``pident`` and ``bitscore`` values in the BLAST+ |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
188 tabular output. |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
189 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
190 For the alignment length, bitscore and percentage identity the values for |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
191 *A vs B* and *B vs A* are typically the same, so their minimum is shown. |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
192 The coverage values are given by the HSP alignment length divided by the |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
193 sequence length (adjusted by a factor of three for TBLASTX). |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
194 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
195 Note that if a sequence has equally scoring top BLAST matches to multiple |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
196 sequence in the other file, it will not be considered for an RBH. This |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
197 can happen following gene duplication, or for (near) identical gene |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
198 duplicates. |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
199 |
1 | 200 The tool can optionally make the FASTA files non-redundant by replacing |
201 repeated identical sequences with a single representative before building | |
202 the databases and running BLAST. | |
203 | |
204 Finally, the tool can be run using the same FASTA input file to look for | |
205 RBH within the dataset. In this case, self matches are discarded. | |
206 | |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
207 .. class:: warningmark |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
208 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
209 **Note** |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
210 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
211 If you are trying to use BLAST RBH matches to identify candidate orthologues |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
212 or transfer annotation, you *must* use a percentage identity and minimum |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
213 coverage threshold or similiar. See: |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
214 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
215 Punta and Ofran (2008) The Rough Guide to In Silico Function Prediction, |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
216 or How To Use Sequence and Structure Information To Predict Protein |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
217 Function. PLoS Comput Biol 4(10): e1000160. |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
218 https://doi.org/10.1371/journal.pcbi.1000160 |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
219 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
220 The defaults are to require 70% sequence identity over the aligned region |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
221 (using ``pident`` in the BLAST+ tabular output), and that the HSP alignment |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
222 covers at least 50% of the query sequence (using ``qcovhsp`` in the BLAST+ |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
223 tabular output). |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
224 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
225 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
226 **References** |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
227 |
2 | 228 Please cite: |
229 | |
230 P.J.A. Cock, J.M. Chilton, B. Gruening, J.E. Johnson, N. Soranzo (2015). | |
231 NCBI BLAST+ integrated into Galaxy. | |
232 *GigaScience* 4:39 | |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
233 https://doi.org/10.1186/s13742-015-0080-7 |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
234 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
235 Christiam Camacho et al. (2009). |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
236 BLAST+: architecture and applications. |
2 | 237 *BMC Bioinformatics* 15;10:421. |
5
8f4500f6f2aa
Refactored to use more than one Python file (internal change only).
peterjc
parents:
4
diff
changeset
|
238 https://doi.org/10.1186/1471-2105-10-421 |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
239 |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
240 This wrapper is available to install into other Galaxy Instances via the Galaxy |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
241 Tool Shed at http://toolshed.g2.bx.psu.edu/view/peterjc/blast_rbh |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
242 </help> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
243 <citations> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
244 <citation type="doi">10.1186/1471-2105-10-421</citation> |
2 | 245 <citation type="doi">10.1186/s13742-015-0080-7</citation> |
0
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
246 </citations> |
b828ca44a313
Uploaded v0.1.2 (previously only on the Test Tool Shed)
peterjc
parents:
diff
changeset
|
247 </tool> |