annotate tools/metag_tools/short_reads_figure_score.xml @ 1:cdcb0ce84a1b

Uploaded
author xuebing
date Fri, 09 Mar 2012 19:45:15 -0500
parents 9071e359b9a3
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
1 <tool id="quality_score_distribution" name="Build base quality distribution" version="1.0.2">
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
2 <description></description>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
3
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
4 <command interpreter="python">short_reads_figure_score.py $input1 $output1 </command>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
5
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
6 <inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
7 <page>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
8 <param name="input1" type="data" format="qualsolexa, qual454" label="Quality score file" help="No dataset? Read tip below"/>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
9 </page>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
10 </inputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
11
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
12 <outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
13 <data name="output1" format="png" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
14 </outputs>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
15 <requirements>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
16 <requirement type="python-module">rpy</requirement>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
17 </requirements>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
18 <tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
19 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
20 <param name="input1" value="solexa.qual" ftype="qualsolexa" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
21 <output name="output1" file="solexaScore.png" ftype="png" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
22 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
23 <test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
24 <param name="input1" value="454.qual" ftype="qual454" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
25 <output name="output1" file="454Score.png" ftype="png" />
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
26 </test>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
27 </tests>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
28 <help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
29
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
30 .. class:: warningmark
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
31
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
32 To use this tool, your dataset needs to be in the *Quality Score* format. Click the pencil icon next to your dataset to set the datatype to *Quality Score* (see below for examples).
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
33
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
34 -----
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
35
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
36 **What it does**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
37
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
38 This tool takes Quality Files generated by Roche (454), Illumina (Solexa), or ABI SOLiD machines and builds a graph showing score distribution like the one below. Such graph allows you to perform initial evaluation of data quality in a single pass.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
39
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
40 -----
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
41
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
42 **Examples of Quality Data**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
43
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
44 Roche (454) or ABI SOLiD data::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
45
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
46 &gt;seq1
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
47 23 33 34 25 28 28 28 32 23 34 27 4 28 28 31 21 28
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
48
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
49 Illumina (Solexa) data::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
50
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
51 -40 -40 40 -40 -40 -40 -40 40
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
52
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
53 -----
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
54
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
55 **Output example**
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
56
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
57 Quality scores are summarized as boxplot (Roche 454 FLX data):
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
58
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
59 .. image:: ./static/images/short_reads_boxplot.png
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
60
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
61 where the **X-axis** is coordinate along the read and the **Y-axis** is quality score adjusted to comply with the Phred score metric. Units on the X-axis depend on whether your data comes from Roche (454) or Illumina (Solexa) and ABI SOLiD machines:
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
62
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
63 - For Roche (454) X-axis (shown above) indicates **relative** position (in %) within reads as this technology produces reads of different lengths;
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
64 - For Illumina (Solexa) and ABI SOLiD X-axis shows **absolute** position in nucleotides within reads.
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
65
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
66 Every box on the plot shows the following values::
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
67
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
68 o &lt;---- Outliers
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
69 o
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
70 -+- &lt;---- Upper Extreme Value that is no more
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
71 | than box length away from the box
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
72 |
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
73 +--+--+ &lt;---- Upper Quartile
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
74 | |
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
75 +-----+ &lt;---- Median
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
76 | |
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
77 +--+--+ &lt;---- Lower Quartile
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
78 |
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
79 |
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
80 -+- &lt;---- Lower Extreme Value that is no more
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
81 than box length away from the box
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
82 o &lt;---- Outlier
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
83
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
84
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
85
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
86 </help>
9071e359b9a3 Uploaded
xuebing
parents:
diff changeset
87 </tool>