comparison tools/fastq/fastq_paired_unpaired.xml @ 0:72e9fcaec61f

Migrated tool version 0.0.4 from old tool shed archive to new tool shed repository
author peterjc
date Tue, 07 Jun 2011 17:21:17 -0400
parents
children 7ed81e36fc1c
comparison
equal deleted inserted replaced
-1:000000000000 0:72e9fcaec61f
1 <tool id="fastq_paired_unpaired" name="Divide FASTQ file into paired and unpaired reads" version="0.0.4">
2 <description>using the read name suffices</description>
3 <command interpreter="python">
4 fastq_paired_unpaired.py $input_fastq.extension $input_fastq
5 #if $output_choice_cond.output_choice=="separate"
6 $output_forward $output_reverse
7 #elif $output_choice_cond.output_choice=="interleaved"
8 $output_paired
9 #end if
10 $output_singles
11 </command>
12 <inputs>
13 <param name="input_fastq" type="data" format="fastq" label="FASTQ file to divide into paired and unpaired reads"/>
14 <conditional name="output_choice_cond">
15 <param name="output_choice" type="select" label="How to output paired reads?">
16 <option value="separate">Separate (two FASTQ files, for the forward and reverse reads, in matching order).</option>
17 <option value="interleaved">Interleaved (one FASTQ file, alternating forward read then partner reverse read).</option>
18 </param>
19 <!-- Seems need these dummy entries here, compare this to indels/indel_sam2interval.xml -->
20 <when value="separate" />
21 <when value="interleaved" />
22 </conditional>
23 </inputs>
24 <outputs>
25 <data name="output_singles" format="input" label="Orphan or single reads"/>
26 <data name="output_forward" format="input" label="Forward paired reads">
27 <filter>output_choice_cond["output_choice"] == "separate"</filter>
28 </data>
29 <data name="output_reverse" format="input" label="Reverse paired reads">
30 <filter>output_choice_cond["output_choice"] == "separate"</filter>
31 </data>
32 <data name="output_paired" format="input" label="Interleaved paired reads">
33 <filter>output_choice_cond["output_choice"] == "interleaved"</filter>
34 </data>
35 </outputs>
36 <tests>
37 </tests>
38 <requirements>
39 <requirement type="python-module">Bio</requirement>
40 </requirements>
41 <help>
42
43 **What it does**
44
45 Using the common read name suffix conventions, it divides a FASTQ file into
46 paired reads, and orphan or single reads.
47
48 The input file should be a valid FASTQ file which has been sorted so that
49 any partner forward+reverse reads are consecutive. The output files all
50 preserve this sort order. Pairing are recognised based on standard name
51 suffices. See below or run the tool with no arguments for more details.
52
53 Any reads where the forward/reverse naming suffix used is not recognised
54 are treated as orphan reads. The tool supports the /1 and /2 convention
55 used by Illumina, the .f and .r convention, and the Sanger convention
56 (see http://staden.sourceforge.net/manual/pregap4_unix_50.html for details).
57
58 Note that this does support multiple forward and reverse reads per template
59 (which is quite common with Sanger sequencing), e.g. this which is sorted
60 alphabetically:
61
62 WTSI_1055_4p17.p1kapIBF
63 WTSI_1055_4p17.p1kpIBF
64 WTSI_1055_4p17.q1kapIBR
65 WTSI_1055_4p17.q1kpIBR
66
67 or this where the reads already come in pairs:
68
69 WTSI_1055_4p17.p1kapIBF
70 WTSI_1055_4p17.q1kapIBR
71 WTSI_1055_4p17.p1kpIBF
72 WTSI_1055_4p17.q1kpIBR
73
74 both become:
75
76 WTSI_1055_4p17.p1kapIBF paired with WTSI_1055_4p17.q1kapIBR
77 WTSI_1055_4p17.p1kpIBF paired with WTSI_1055_4p17.q1kpIBR
78
79 </help>
80 </tool>