annotate short_reads_trim_seq.xml @ 0:f17a1585733b draft

Imported from capsule None
author devteam
date Mon, 19 May 2014 12:34:17 -0400
parents
children 25e6fe525306
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
1 <tool id="trim_reads" name="Select high quality segments" version="1.0.0">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
2 <description></description>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
3
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
4 <command interpreter="python">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
5 short_reads_trim_seq.py $trim $length $output1 $input1 $input2 $sequencing_method_choice.input3
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
6 </command>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
7 <inputs>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
8 <page>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
9 <param name="input1" type="data" format="fasta" label="Reads" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
10 <param name="input2" type="data" format="qualsolexa,qual454" label="Quality scores" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
11 <param name="trim" type="integer" size="5" value="20" label="Minimal quality score" help="bases scoring below this value will trigger splitting"/>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
12 <param name="length" type="integer" size="5" value="100" label="Minimal length of contiguous segment" help="report all high quality segments above this length. Setting this option to '0' will cause the program to return a single longest run of high quality bases per read" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
13 <conditional name="sequencing_method_choice">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
14 <param name="sequencer" type="select" label="Select technology">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
15 <option value="454">Roche (454) or ABI SOLiD</option>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
16 <option value="Solexa">Illumina (Solexa)</option>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
17 </param>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
18 <when value="454">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
19 <param name="input3" type="select" label="Low quality bases in homopolymers" help="if set to 'DO NOT trigger splitting' the program will not count low quality bases that are within or adjacent to homonucleotide runs. This will significantly reduce fragmentation of 454 data">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
20 <option value="yes">DO NOT trigger splitting </option>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
21 <option value="no">trigger splitting</option>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
22 </param>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
23 </when>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
24 <when value="Solexa">
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
25 <param name="input3" type="integer" size="5" value="0" label="Restrict length of each read to" help="('0' = do not trim) The quality of Solexa reads drops towards the end. This option allows selecting the specified number of nucleotides from the beginning and then running the tool." />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
26 </when>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
27 </conditional>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
28 </page>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
29 </inputs>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
30
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
31 <outputs>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
32 <data name="output1" format="fasta" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
33 </outputs>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
34
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
35 <tests>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
36 <test>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
37 <param name="sequencer" value="454" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
38 <param name="input1" value="454.fasta" ftype="fasta" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
39 <param name="input2" value="454.qual" ftype="qual454" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
40 <param name="input3" value="no" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
41 <param name="trim" value="20" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
42 <param name="length" value="0" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
43 <output name="output1" file="short_reads_trim_seq_out1.fasta" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
44 </test>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
45 <test>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
46 <param name="sequencer" value="Solexa" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
47 <param name="input1" value="solexa.fasta" ftype="fasta" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
48 <param name="input2" value="solexa.qual" ftype="qualsolexa" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
49 <param name="input3" value="0" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
50 <param name="trim" value="20" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
51 <param name="length" value="0" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
52 <output name="output1" file="short_reads_trim_seq_out2.fasta" />
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
53 </test>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
54 </tests>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
55
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
56 <help>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
57
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
58 .. class:: warningmark
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
59
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
60 To use this tool, your dataset needs to be in the *Quality Score* format. Click the pencil icon next to your dataset to set the datatype to *Quality Score* (see below for examples).
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
61
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
62 -----
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
63
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
64 **What it does**
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
65
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
66 This tool finds high quality segments within sequencing reads generated by by Roche (454), Illumina (Solexa), or ABI SOLiD machines.
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
67
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
68 -----
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
69
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
70 **Example**
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
71
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
72
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
73 Suppose this is your sequencing read::
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
74
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
75 5'---------*-------------*------**----3'
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
76
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
77 where **dashes** (-) are HIGH quality bases (above 20) and **asterisks** (*) are LOW quality bases (below 20). If the **Minimal length of contiguous segment** is set to **5** (of course, only for the purposes of this example), the tool will return::
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
78
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
79 5'---------
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
80 -------------
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
81 -------
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
82
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
83 you can see that the tool simply splits the read on low quality bases and then returns all segments longer than 5. **Note**, that the output of this tool will likely contain higher number of shorter sequences compared to the original input. If we set the **Minimal length of contiguous segment** to **0**, the tool will only return the single longest segment::
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
84
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
85 -------------
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
86
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
87
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
88
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
89
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
90
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
91
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
92 </help>
f17a1585733b Imported from capsule None
devteam
parents:
diff changeset
93 </tool>