annotate Scan_IUPAC_output_each_match.xml @ 2:2f48cf393d25 draft

Add Perl scripts missing from previous upload.
author pjbriggs
date Mon, 09 Apr 2018 04:56:28 -0400
parents b67ea47730d3
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
1 <?xml version="1.0" encoding="utf-8"?>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
2 <tool id="fasta_scan_iupac_each" name="IUPAC scan and output each match" version="@VERSION@">
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
3 <description>Returns all matches to a given IUPAC in GFF format</description>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
4 <macros>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
5 <import>motif_tools_macros.xml</import>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
6 </macros>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
7 <expand macro="requirements" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
8 <command><![CDATA[
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
9 perl $__tool_directory__/Scan_IUPAC_output_each_match.pl $iupac $fasta $output $label $strand
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
10 ]]></command>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
11 <inputs>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
12 <param name="iupac" type="text" label="IUPAC string" value="e.g. WGATAR" help="Enter an IUPAC string." size="20"/>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
13 <param format="fasta" name="fasta" type="data" label="FASTA file" help="Select a FASTA file containing the sequences to be scanned."/>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
14 <param name="label" type="text" label="Attribute in GFF output" value="IUPAC_or_name" help="The label will be included at the end (attibute) section of each GFF line. This could be the IUPAC string used or the name of the motif." size="20"/>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
15 <param name="strand" type="select" label="Select sequence strands to scan" help="Scan either both strands or only the forward strand.">
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
16 <option value="0">Scan both strands</option>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
17 <option value="1">Only scan forward strand</option>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
18 </param>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
19 </inputs>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
20 <outputs>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
21 <data format="gff" name="output" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
22 </outputs>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
23 <tests>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
24 <test>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
25 <param name="iupac" value="WGATAR" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
26 <param name="fasta" value="phix.fa" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
27 <param name="label" value="IUPAC_or_name" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
28 <param name="strand" value="0" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
29 <output name="output" file="iupac_each_match.gff" />
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
30 </test>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
31 </tests>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
32
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
33 <help>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
34 .. class:: infomark
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
35
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
36 **What it does**
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
37
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
38 This tool will find all matches to a DNA pattern in the input DNA sequence, represented by an IUPAC string. The matches are non-overlapping, so searching with 'TTTT' in 'TTTTTTTT' will find two hits to the IUPAC. The output is in GFF format and the last 'attribute' field can be specified using the 'Label' option.
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
39
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
40 IUPAC = Nucleotide(s):
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
41
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
42 A = A
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
43
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
44 C = C
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
45
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
46 G = G
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
47
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
48 T = T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
49
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
50 M = A/C
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
51
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
52 R = A/G
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
53
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
54 W = A/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
55
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
56 S = C/G
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
57
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
58 Y = C/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
59
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
60 K = G/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
61
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
62 V = A/C/G
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
63
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
64 H = A/C/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
65
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
66 D = A/G/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
67
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
68 B = C/G/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
69
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
70 N = A/C/G/T
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
71
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
72 ----
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
73
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
74 .. class:: infomark
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
75
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
76 **Options**
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
77
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
78 'IUPAC string' - can be entered as upper- or lower-case as the tool will force them to become upper-case, but will only accept the IUPAC codes listed above.
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
79
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
80 'Attribute in GFF output' - the last field of each GFF line 'attribute' can be specified using the 'Label' option, this should only include letters/numbers, but without spaces.
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
81
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
82 'Select sequence strands to scan' - Only scanning the forward strand of the input sequence is useful if the IUPAC is a palindrome (e.g. CANNTG).
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
83
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
84 ----
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
85
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
86 .. class:: infomark
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
87
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
88 **Credits**
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
89
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
90 This Galaxy tool has been developed within the Bioinformatics Core Facility at the University of Manchester. It runs the Scan_IUPAC_output_each_match.pl Perl script that was written by Ian Donaldson.
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
91
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
92 Please kindly acknowledge both this Galaxy tool and Scan_IUPAC_output_each_match.pl if you use it.
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
93 </help>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
94
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
95 </tool>
b67ea47730d3 Version 1.0.1.
pjbriggs
parents:
diff changeset
96