comparison FastQ_QualConverter.xml @ 1:3990d6b37e2d draft default tip

Uploaded
author geert-vandeweyer
date Thu, 13 Feb 2014 08:24:43 -0500
parents
children
comparison
equal deleted inserted replaced
0:c450731486c8 1:3990d6b37e2d
1 <tool id="fastq_qual_convert" name="FASTQ QualityConverter" version="1.0.4">
2 <description>convert from various base-FASTQ quality formats to fastqsanger</description>
3 <command interpreter="perl">FastQ_QualConverter.pl -i '$input_file' -f '$input_type' -o '$output_file'</command>
4 <inputs>
5 <param name="input_file" type="data" format="fastq" label="File to Convert" />
6 <param name="input_type" type="select" label="Input FASTQ quality scores type">
7 <option value='Auto' selected="True">Auto</option>
8 <option value="solexa">Solexa</option>
9 <option value="illumina">Illumina 1.3-1.7</option>
10 <option value="sanger">Sanger (does nothing)</option>
11 </param>
12 </inputs>
13 <outputs>
14 <data name="output_file" format="fastqsanger">
15 </data>
16 </outputs>
17 <tests>
18 <!-- These tests include test files adapted from supplemental material in Cock PJ, Fields CJ, Goto N, Heuer ML, Rice PM. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2009 Dec 16. -->
19 <!-- Unfortunately, cannot test for expected failures -->
20 <!-- Test basic options -->
21 <test>
22 <param name="input_file" value="sanger_full_range_original_sanger.fastqsanger" ftype="fastq" />
23 <param name="input_type" value="sanger" />
24 <output name="output_file" file="sanger_full_range_original_sanger.fastqsanger" />
25 </test>
26 <test>
27 <param name="input_file" value="illumina_full_range_original_illumina.fastqillumina" ftype="fastq" />
28 <param name="input_type" value="illumina" />
29 <output name="output_file" file="illumina_full_range_as_sanger.fastqsanger" />
30 </test>
31 <test>
32 <param name="input_file" value="solexa_full_range_original_solexa.fastqsolexa" ftype="fastq" />
33 <param name="input_type" value="solexa" />
34 <output name="output_file" file="solexa_full_range_as_sanger.fastqsanger" />
35 </test>
36 </tests>
37 <help>
38 **What it does**
39
40 This tool offers several conversions options relating to the FASTQ format.Output is always fastqsanger. Input can be specified or auto detected (based on first 15000 reads).
41
42 Hopefully it is faster than the default fastq groomer.
43
44
45 -----
46
47 **Quality Score Comparison**
48
49 ::
50
51 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
52 ...............................IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII
53 ..........................XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
54 !"#$%&amp;'()*+,-./0123456789:;&lt;=&gt;?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~
55 | | | | | |
56 33 59 64 73 104 126
57
58 S - Sanger Phred+33, 93 values (0, 93) (0 to 60 expected in raw reads) (sanger = input)
59 I - Illumina 1.3 Phred+64, 62 values (0, 62) (0 to 40 expected in raw reads) (sanger = input - 31)
60 X - Solexa Solexa+64, 67 values (-5, 62) (-5 to 40 expected in raw reads) (sanger = 33 + 10 * log(1 + 10 ** (input) - 64) / 10.0)) / log(10);
61
62 Diagram adapted from http://en.wikipedia.org/wiki/FASTQ_format
63
64 .. class:: infomark
65
66 Output from Illumina 1.8+ pipelines are Sanger encoded.
67
68
69 </help>
70 </tool>