0
|
1 <tool id="fasta_affixer" name="FASTA read name affixer" version="1.0.0">
|
9
|
2 <description> Appending suffix and prefix to the read names </description>
|
0
|
3 <command interpreter="python3">
|
|
4 fasta_affixer.py -f $input -p "$prefix" -s "$suffix" -n $nspace -o $output
|
|
5 </command>
|
|
6
|
|
7 <inputs>
|
9
|
8 <param format="fasta" type="data" name="input" label="Choose your FASTA file" />
|
|
9 <param name="prefix" type="text" size="10" value="" label="Prefix" help="Enter prefix which will be added to all read names" />
|
|
10 <param name="suffix" type="text" size="10" value="" label="Suffix" help="Enter suffix which will be added to all read names"/>
|
|
11 <param name="nspace" type="integer" size="10" value="0" min="0" max="1000" label="Number of spaces in the name to ignore" help="By default, a string before the first space is considered to be the read name, and all characters following the space are discarded. If you want to keep the information following the space(s) in the name, enter positive integer."/>
|
0
|
12 </inputs>
|
|
13
|
|
14
|
|
15 <outputs>
|
9
|
16 <data format="fasta" name="output" label="FASTA dataset ${input.hid} with modified sequence names" />
|
0
|
17 </outputs>
|
|
18
|
|
19 <tests>
|
|
20 <test>
|
|
21 <param name="input" value="single_output.fasta" />
|
|
22 <param name="prefix" value="TEST" />
|
|
23 <param name="suffux" value="OK"/>
|
|
24 <param name="nspace" value="0" />
|
|
25 <output name="output" value="prefix_suffix.fasta" />
|
|
26 </test>
|
|
27 </tests>
|
|
28 <help>
|
|
29 **What is does**
|
|
30
|
|
31 Tool for appending prefix and suffix to sequences names in fasta formated sequences. This tool is useful
|
|
32 if you want to do comparative analysis with RepeatExplorer and need to
|
|
33 append sample codes to sequence identifiers
|
|
34
|
|
35 **Example**
|
|
36 The following fasta file:
|
|
37
|
|
38 ::
|
|
39
|
|
40 >123454
|
|
41 acgtactgactagccatgacg
|
|
42 >234235
|
|
43 acgtactgactagccatgacg
|
|
44
|
|
45 is renamed to:
|
|
46
|
|
47 ::
|
|
48
|
|
49 >prefix123454suffix
|
|
50 acgtactgactagccatgacg
|
|
51 >prefix234235suffix
|
|
52 acgtactgactagccatgacg
|
|
53
|
|
54
|
|
55 By default, anything after spaces is
|
|
56 excluded from sequences name. In example sequence:
|
|
57
|
|
58 ::
|
|
59
|
|
60 >SRR352150.23846180 HWUSI-EAS1786:7:119:15910:19280/1
|
|
61 CTGGATTCTATACCTTTGGCAACTACTTCTTGGTTGATCAGGAAATTAACACTAGTAGTTTAGGCAATTTGGAATGGTGCCAAAGATGTATAGAACTTTC
|
|
62 IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIGIIIHIIIIIFIIIIIIHDHBBIHFIHIIBHHDDHIFHIHIIIHIHGGDFDEI@EGEGFGFEFB@ECG
|
|
63
|
|
64 when **Number of spaces in name to ignore** is set to 0 (default) the output will be:
|
|
65
|
|
66 ::
|
|
67
|
|
68 >prefixSRR352150.23846180suffix
|
|
69 CTGGATTCTATACCTTTGGCAACTACTTCTTGGTTGATCAGGAAATTAACACTAGTAGTTTAGGCAATTTGGAATGGTGCCAAAGATGTATAGAACTTTC
|
|
70
|
|
71
|
|
72 If you want to keep spaces the setting **Number of spaces in name to ignore** to 1 will yield
|
|
73
|
|
74 ::
|
|
75
|
|
76 >prefixSRR352150.23846180 HWUSI-EAS1786:7:119:15910:19280/1suffix
|
|
77 CTGGATTCTATACCTTTGGCAACTACTTCTTGGTTGATCAGGAAATTAACACTAGTAGTTTAGGCAATTTGGAATGGTGCCAAAGATGTATAGAACTTTC
|
|
78
|
|
79
|
|
80 </help>
|
|
81 </tool>
|