annotate tools/taxonomy/find_diag_hits.xml @ 1:cdcb0ce84a1b

Uploaded
author xuebing
date Fri, 09 Mar 2012 19:45:15 -0500
parents 9071e359b9a3
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
1 <tool id="find_diag_hits" name="Find diagnostic hits" version="1.0.0">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
2 <description></description>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
3 <requirements>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
4 <requirement type="package">taxonomy</requirement>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
5 </requirements>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
6 <command interpreter="python">find_diag_hits.py $input1 $id_col $rank_list $out_format $out_file1</command>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
7 <inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
8 <param format="taxonomy" name="input1" type="data" label="Find diagnostic hits in"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
9 <param name="id_col" type="data_column" data_ref="input1" numerical="False" label="Select column with sequence id" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
10 <param name="rank_list" type="select" display="checkboxes" multiple="true" label="select taxonomic ranks">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
11 <option value="superkingdom">Superkingdom</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
12 <option value="kingdom">Kingdom</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
13 <option value="subkingdom">Subkingdom</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
14 <option value="superphylum">Superphylum</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
15 <option value="phylum">Phylum</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
16 <option value="subphylum">Subphylum</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
17 <option value="superclass">Superclass</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
18 <option value="class">Class</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
19 <option value="subclass">Subclass</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
20 <option value="superorder">Superorder</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
21 <option value="order">Order</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
22 <option value="suborder">Suborder</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
23 <option value="superfamily">Superfamily</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
24 <option value="family">Family</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
25 <option value="subfamily">Subfamily</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
26 <option value="tribe">Tribe</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
27 <option value="subtribe">Subtribe</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
28 <option value="genus">Genus</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
29 <option value="subgenus">Subgenus</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
30 <option selected="true" value="species">Species</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
31 <option value="subspecies">Subspecies</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
32 </param>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
33 <param name="out_format" type="select" label="Select output format">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
34 <option value="reads">Diagnostic read list</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
35 <option value="counts">Number of diagnostic reads per taxonomic rank</option>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
36 </param>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
37 </inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
38 <outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
39 <data format="tabular" name="out_file1" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
40 </outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
41 <tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
42 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
43 <param name="input1" value="taxonomyGI.taxonomy" ftype="taxonomy"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
44 <param name="id_col" value="1" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
45 <param name="rank_list" value="order,genus" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
46 <param name="out_format" value="counts" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
47 <output name="out_file1" file="find_diag_hits.tabular" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
48 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
49 </tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
50
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
51
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
52 <help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
53
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
54 **What it does**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
55
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
56 When performing metagenomic analyses it is often necessary to identify sequence reads corresponding to a particular taxonomic group, or, in other words, diagnostic of a particular taxonomic rank. This utility performs this analysis. It takes data generated by *Taxonomy manipulation->Fetch Taxonomic Ranks* as input and outputs either a list of sequence reads unique to a particular taxonomic rank, or a list of taxonomic ranks and the count of unique reads corresponding to each rank.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
57
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
58 ------
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
59
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
60 **Example**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
61
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
62 Suppose the *Taxonomy manipulation->Fetch Taxonomic Ranks* generated the following taxonomy representation::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
63
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
64 read1 2 root Eukaryota Metazoa n n Chordata Craniata Gnathostomata Mammalia n Laurasiatheria n Ruminantia n Bovidae Bovinae n n Bos n Bos taurus n
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
65 read2 12585 root Eukaryota Metazoa n n Chordata Craniata Gnathostomata Mammalia n Euarchontoglires Primates Haplorrhini Hominoidea Hominidae n n n Homo n Homo sapiens n
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
66 read1 58615 root Eukaryota Metazoa n n Arthropoda n Hexapoda Insecta Neoptera Amphiesmenoptera Lepidoptera Glossata Papilionoidea Nymphalidae Nymphalinae Melitaeini Phyciodina Anthanassa n Anthanassa otanes n
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
67 read3 56785 root Eukaryota Metazoa n n Chordata Craniata Gnathostomata Mammalia n Euarchontoglires Primates Haplorrhini Hominoidea Hominidae n n n Homo n Homo sapiens n
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
68
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
69 Running this tool with the following parameters:
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
70
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
71 * *Select column with sequence id* set to **c1**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
72 * *Select taxonomic ranks* with **order**, and **genus** checked
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
73 * *Output format* set to **Diagnostic read list**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
74
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
75 will return::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
76
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
77 read2 Primates order
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
78 read3 Primates order
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
79 read2 Homo genus
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
80 read3 Homo genus
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
81
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
82 Changing *Output format* set to **Number of diagnostic reads per taxonomic rank** will produce::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
83
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
84 Primates 2 order
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
85 Homo 2 genus
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
86
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
87 .. class:: infomark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
88
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
89 Note that **read1** is omitted because it is non-unique: it hits Mammals and Insects at the same time.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
90
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
91 --------
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
92
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
93 .. class:: warningmark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
94
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
95 This tool omits "**n**" corresponding to ranks missing from NCBI taxonomy. In the above example *Home sapiens* contains the order name (Primates) while *Bos taurus* does not.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
96
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
97
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
98 </help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
99 </tool>