annotate barcode_splitter-bc23f6946bb8/fastx_barcode_splitter.xml @ 0:2b6d577dd1ab default tip

Uploaded
author bccarstens
date Mon, 16 Jan 2012 22:38:10 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
1 <tool id="cshl_fastx_barcode_splitter" name="Barcode Splitter" force_history_refresh="True">
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
2 <description></description>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
3 <requirements><requirement type="package">fastx_toolkit</requirement></requirements>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
4 <command interpreter="python">fastx_barcode_splitter_galaxy_wrapper.py
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
5 ## params for galaxy wrapper
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
6 $output
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
7 "$output.id"
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
8 "$input.ext"
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
9 "$__new_file_path__"
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
10 --barcodes='$barcodes'
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
11 $BARCODE $input "$input.name" "$output.extra_files_path"
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
12 ## params for fastx_barcode_splitter
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
13 --mismatches $mismatches --partial $partial $EOL
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
14 </command>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
15
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
16 <inputs>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
17 <param format="txt" name="BARCODE" type="data" label="Barcodes to use" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
18 <param format="fasta,fastqsanger,fastqsolexa,fastqillumina" name="input" type="data" label="Library to split" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
19
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
20 <param name="EOL" type="select" label="Barcodes found at">
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
21 <option value="--bol">Start of sequence (5' end)</option>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
22 <option value="--eol">End of sequence (3' end)</option>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
23 </param>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
24
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
25 <param name="mismatches" type="integer" size="3" value="2" label="Number of allowed mismatches" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
26
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
27 <param name="partial" type="integer" size="3" value="0" label="Number of allowed barcodes nucleotide deletions" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
28
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
29 <param name="barcodes" type="select" multiple="true" label="Select barcodes to add as new datasets to history">
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
30 <options from_dataset="BARCODE">
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
31 <column name="name" index="0"/>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
32 <column name="value" index="0"/>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
33 <filter type="unique_value" name="unq_bc" column="0" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
34 <filter type="add_value" name="unmatched" value="unmatched"/>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
35 </options>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
36 </param>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
37 </inputs>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
38
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
39 <outputs>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
40 <data format="html" name="output" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
41 </outputs>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
42
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
43 <tests>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
44 <test>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
45 <!-- Split a FASTQ file -->
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
46 <param name="BARCODE" value="fastx_barcode_splitter1.txt" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
47 <param name="input" value="fastx_barcode_splitter1.fastq" ftype="fastqsolexa" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
48 <param name="EOL" value="Start of sequence (5' end)" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
49 <param name="mismatches" value="2" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
50 <param name="partial" value="0" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
51 <output name="output" file="fastx_barcode_splitter1.out" />
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
52 </test>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
53 </tests>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
54
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
55 <help>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
56
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
57 **What it does**
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
58
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
59 This tool splits a Solexa library (FASTQ file) or a regular FASTA file into several files, using barcodes as the split criteria.
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
60
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
61 --------
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
62
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
63 **Barcode file Format**
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
64
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
65 Barcode files are simple text files.
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
66 Each line should contain an identifier (descriptive name for the barcode), and the barcode itself (A/C/G/T), separated by a TAB character.
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
67 Example::
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
68
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
69 #This line is a comment (starts with a 'number' sign)
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
70 BC1 GATCT
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
71 BC2 ATCGT
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
72 BC3 GTGAT
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
73 BC4 TGTCT
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
74
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
75 For each barcode, a new FASTQ file will be created (with the barcode's identifier as part of the file name).
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
76 Sequences matching the barcode will be stored in the appropriate file.
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
77
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
78 One additional FASTQ file will be created (the 'unmatched' file), where sequences not matching any barcode will be stored.
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
79
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
80 The output of this tool is an HTML file, displaying the split counts and the file locations.
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
81
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
82 **Output Example**
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
83
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
84 .. image:: ./static/fastx_icons/barcode_splitter_output_example.png
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
85
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
86 </help>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
87 </tool>
2b6d577dd1ab Uploaded
bccarstens
parents:
diff changeset
88 <!-- FASTX-barcode-splitter is part of the FASTX-toolkit, by A.Gordon (gordon@cshl.edu) -->