annotate fastq_name_affixer.xml @ 3:e320ef2d105a draft

Uploaded
author petr-novak
date Thu, 05 Sep 2019 09:04:56 -0400
parents
children c2c69c6090f0
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
1 <tool id="names_affixer" name="FASTQ Read name affixer" version="1.0.0">
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
2 <description> Tool appending suffix and prefix to sequences names </description>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
3 <command interpreter="python">
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
4 ${__tool_directory__}/name_affixer.py -f $input -p "$prefix" -s "$suffix" -n $nspace > $output
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
5 </command>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
6
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
7 <inputs>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
8 <param format="fastq" type="data" name="input" label="Choose your fastq file" />
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
9 <param name="prefix" type="text" size="10" value="" label="Prefix" help="Enter prefix which will be added to all sequences names" />
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
10 <param name="suffix" type="text" size="10" value="" label="Suffix" help="Enter suffix which will be added to all sequences names"/>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
11 <param name="nspace" type="integer" size="10" value="0" min="0" max="1000" label="Number of spaces in name to ignore" help="Sequence name is a string before the first space. If you want name to include spaces in name, enter positive integer. All other characters beyond ignored spaces are omitted"/>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
12 </inputs>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
13
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
14
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
15 <outputs>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
16 <data format="fastq" name="output" label="fastq dataset ${input.hid} with modified sequence names" />
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
17 </outputs>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
18
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
19 <help>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
20 **What is does**
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
21
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
22 Tool for appending prefix and suffix to sequences names in fastq formated sequences.
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
23
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
24 **Example**
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
25
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
26 The following Solexa-FASTQ file:
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
27
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
28 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
29
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
30 @CSHL_4_FC042GAMMII_2_1_517_596
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
31 GGTCAATGATGAGTTGGCACTGTAGGCACCATCAAT
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
32 +CSHL_4_FC042GAMMII_2_1_517_596
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
33 40 40 40 40 40 40 40 40 40 40 38 40 40 40 40 40 14 40 40 40 40 40 36 40 13 14 24 24 9 24 9 40 10 10 15 40
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
34
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
35 is renamed to:
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
36
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
37 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
38
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
39 @prefixCSHL_4_FC042GAMMII_2_1_517_596suffix
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
40 GGTCAATGATGAGTTGGCACTGTAGGCACCATCAAT
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
41 +prefixCSHL_4_FC042GAMMII_2_1_517_596suffix
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
42 40 40 40 40 40 40 40 40 40 40 38 40 40 40 40 40 14 40 40 40 40 40 36 40 13 14 24 24 9 24 9 40 10 10 15 40
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
43
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
44 different format:
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
45
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
46
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
47 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
48
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
49 @HISEQ1:92:c0190acxx:8:1101:1252:2230 2:N:0:CGATGT
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
50 AGAGGAAAAAACATAGTTCTTGTCTAAAAAAATCCCTTGAAAAAGGGCAGATGTATAGAAATAGAAAATTTCAAAGAAAAACTCTCTACAAATGGAAGAGA
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
51 +
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
52 CCCFFFFFHHHHHJJJJIJJJJJJJJJJJJJJJIJJJJJIIJJJJJJGIJIJIHHHHHHHHFFFFFFDEEEEEDCDDDDDDDCCDDDEDDDDD>CCCCB@9
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
53
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
54 is renamed to:
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
55
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
56 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
57
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
58 @prefixHISEQ1:92:c0190acxx:8:1101:1252:2230suffix
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
59 AGAGGAAAAAACATAGTTCTTGTCTAAAAAAATCCCTTGAAAAAGGGCAGATGTATAGAAATAGAAAATTTCAAAGAAAAACTCTCTACAAATGGAAGAGA
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
60 +
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
61 CCCFFFFFHHHHHJJJJIJJJJJJJJJJJJJJJIJJJJJIIJJJJJJGIJIJIHHHHHHHHFFFFFFDEEEEEDCDDDDDDDCCDDDEDDDDD>CCCCB@9
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
62
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
63 note that string after first space is omitted!
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
64
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
65 Because sequence names sometimes containg spaces which delimit the actual name. By default, anything after spaces is
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
66 excluded from sequences name. In example sequence:
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
67
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
68 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
69
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
70 @SRR352150.23846180 HWUSI-EAS1786:7:119:15910:19280/1
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
71 CTGGATTCTATACCTTTGGCAACTACTTCTTGGTTGATCAGGAAATTAACACTAGTAGTTTAGGCAATTTGGAATGGTGCCAAAGATGTATAGAACTTTC
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
72 +
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
73 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGIIIHIIIIIFIIIIIIHDHBBIHFIHIIBHHDDHIFHIHIIIHIHGGDFDEI@EGEGFGFEFB@ECG
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
74
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
75 when **Number of spaces in name to ignore** is set to 0 (default) the output will be:
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
76
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
77 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
78
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
79 @prefixSRR352150.23846180suffix
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
80 CTGGATTCTATACCTTTGGCAACTACTTCTTGGTTGATCAGGAAATTAACACTAGTAGTTTAGGCAATTTGGAATGGTGCCAAAGATGTATAGAACTTTC
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
81 +
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
82 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGIIIHIIIIIFIIIIIIHDHBBIHFIHIIBHHDDHIFHIHIIIHIHGGDFDEI@EGEGFGFEFB@ECG
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
83
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
84 If you want to keep spaces the setting **Number of spaces in name to ignore** to 1 will yield
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
85
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
86 ::
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
87
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
88 @prefixSRR352150.23846180 HWUSI-EAS1786:7:119:15910:19280/1suffix
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
89 CTGGATTCTATACCTTTGGCAACTACTTCTTGGTTGATCAGGAAATTAACACTAGTAGTTTAGGCAATTTGGAATGGTGCCAAAGATGTATAGAACTTTC
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
90 +
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
91 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGIIIHIIIIIFIIIIIIHDHBBIHFIHIIBHHDDHIFHIHIIIHIHGGDFDEI@EGEGFGFEFB@ECG
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
92
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
93
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
94 </help>
e320ef2d105a Uploaded
petr-novak
parents:
diff changeset
95 </tool>