annotate mytrimmer/trimPE.xml @ 0:68a3648c7d91 draft default tip

Uploaded
author matteoc
date Thu, 22 Dec 2016 04:45:31 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
1 <tool id="trimmer" name="Custom quality trimmer" version="0.">
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
2 <description>trimming tool for Illumina PE data</description>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
3 <command> /home/inmare/galaxy/tools/mytrimmer/trimPE $f1 $f2 $l $ne $disc $o1 $o2 $o3</command>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
4 <description> "approved by the boss" </description>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
5 <inputs>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
6 <param name="f1" type="data" format="fastq" label="R1 reads file" help="fastq only"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
7 <param name="f2" type="data" format="fastq" label="R2 reads file" help="fastq only"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
8 <param name="l" type="integer" label="minimum read length" value="100" help="don't go too short!"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
9 <param name="ne" type="integer" label="max number of errors" value="10" help="usually between 2 and 10"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
10 <param name="disc" type="integer" label="cut xx bp from the beginning of each read" value="15" help="have a look to the fastqc report"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
11 </inputs>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
12 <outputs>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
13 <data name="o1" ftype="fastq" format="fastq" label="r1 trimmed reads"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
14 <data name="o2" ftype="fastq" format="fastq" label="r2 trimmed reads"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
15 <data name="o3" ftype="fastq" format="fastq" label="unmated reads"/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
16 </outputs>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
17 <test/>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
18 <help>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
19 Our custom script for quality trimming implements strict quality filters based on the provided base call quality scores. Reads are iteratively trimmed from the 3' end until all of the following conditions were satisfied:
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
20 1. the median quality score (Qscore) of upstream bases is ≥15
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
21 2. less than 10 bases with Qscore ≤10 and less than 15 bases with Qscore ≤ 20 are present in the upstream sequence
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
22 3. the cumulative error probability in the upstream region is below a user defined cutoff
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
23 4. the length of the trimmed read exceeds a user defined cutoff
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
24
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
25 The program is designed to work with paired end sequencing files only, the output consists in 3 distinct files containing the pairs where both mate passed the filters (output to 2 separate files) and a third file containing all the singleton reads, for which the corresponding mate did not pass the quality filters.
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
26 </help>
68a3648c7d91 Uploaded
matteoc
parents:
diff changeset
27 </tool>