annotate tools/RepeatMasker.xml @ 0:13df908a02b0

Initial commit
author bjoern-gruening
date Wed, 11 Jan 2012 04:50:59 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
1 <tool id="repeatmasker_wrapper" name="RepeatMasker" version="0.1">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
2 <description>Masks different kind of repeats</description>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
3 <command>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
4 ## The command is a Cheetah template which allows some Python based syntax.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
5 ## Lines starting hash hash are comments. Galaxy will turn newlines into spaces
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
6
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
7 ## create temp directory
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
8 #import tempfile, os
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
9 #set $dirname = os.path.abspath(tempfile.mkdtemp())
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
10 #set $input_filename = os.path.split(str($query))[-1]
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
11 #set $output_basename = os.path.join($dirname, $input_filename)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
12
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
13
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
14 RepeatMasker
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
15 -parallel 8
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
16
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
17 $nolow
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
18 $noint
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
19 $norna
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
20
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
21 #if str($species)!="all":
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
22 $species
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
23 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
24
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
25
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
26 -dir $dirname
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
27
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
28 #if $adv_opts.adv_opts_selector=="advanced":
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
29
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
30 #if str($adv_opts.gc)!="0":
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
31 -gc $adv_opts.gc
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
32 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
33
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
34 $adv_opts.gccalc
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
35
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
36 #set $output_files_list = str($adv_opts.output_files).split(',')
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
37 #if "gff" in $output_files_list:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
38 -gff
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
39 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
40 #if "html" in $output_files_list:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
41 -html
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
42 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
43
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
44 $adv_opts.slow_search
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
45 $adv_opts.quick_search
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
46 $adv_opts.rush_search
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
47 $adv_opts.only_alus
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
48 $adv_opts.is_only
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
49
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
50 #else:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
51 ## Set defaults
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
52 -gff
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
53
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
54 ## End of advanced options:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
55 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
56
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
57 $query
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
58
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
59
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
60 > /dev/null 2> /dev/null;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
61 ## Copy the output files to galaxy
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
62 #if $adv_opts.adv_opts_selector=="advanced":
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
63
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
64 #if "summary" in $output_files_list:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
65 ## Write out the summary file (default)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
66 #set $summary_file = $output_basename + '.tbl'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
67 cp $summary_file $output_summary;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
68 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
69
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
70 #if "gff" in $output_files_list:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
71 ## Write out the gff file (default)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
72 #set $gff_file = $output_basename + '.out.gff'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
73 cp $gff_file $output_gff;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
74 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
75
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
76 #if "html" in $output_files_list:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
77 ## Write out the html file
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
78 #set $html_file = $output_basename + '.out.html'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
79 cp $html_file $output_html;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
80 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
81
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
82 #else:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
83
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
84 ## Write out the summary file (default)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
85 #set $summary_file = $output_basename + '.tbl'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
86 cp $summary_file $output_summary;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
87
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
88 ## Write out the gff file (default)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
89 #set $gff_file = $output_basename + '.out.gff'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
90 cp $gff_file $output_gff;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
91
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
92
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
93 ## End of advanced options:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
94 #end if
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
95
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
96 ## Write out mask sequence file
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
97 #set $mask_sequence_file = $output_basename + '.masked'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
98 cp $mask_sequence_file $output_mask;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
99
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
100 ## Write out standard file (default)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
101 ## The default '.out' file from RepeatMasker has a 3-line header and spaces rather
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
102 ## than tabs. Remove the header and replace the whitespaces with tab
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
103 #set $standard_file = $output_basename + '.out'
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
104 tail -n +4 $standard_file | tr -s ' ' '\t' > $output_std;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
105
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
106 ## Delete all temporary files
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
107 rm $dirname -r;
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
108
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
109
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
110 </command>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
111 <inputs>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
112 <param name="query" type="data" format="fasta" label="Nucleotide query sequence(s)"/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
113
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
114 <param name="nolow" type="boolean" label="No low complexity DNA" truevalue="-nolow" falsevalue="" checked="false" help="Does not mask low_complexity DNA or simple repeats."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
115 <param name="noint" type="boolean" label="No interspersed repeats" truevalue="-noint" falsevalue="" checked="false" help="Only masks low complex/simple repeats (no interspersed repeats)."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
116
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
117 <param name="norna" type="boolean" label="No small RNA genes" truevalue="-norna" falsevalue="" checked="false" help="Does not mask small RNA (pseudo) genes."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
118
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
119 <!--
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
120 Specify the species or clade of the input sequence. The species name
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
121 must be a valid NCBI Taxonomy Database species name and be contained
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
122 in the RepeatMasker repeat database. The following collection is not complete.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
123 -->
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
124 <param name="species" type="select" label="Species" help="The list is not complete, if you need other species contact your administrator.">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
125 <option value="-species anopheles">anopheles</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
126 <option value="-species arabidopsis">arabidopsis</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
127 <option value="-species artiodactyl">artiodactyl</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
128 <option value="-species aspergillus">aspergillus</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
129 <option value="-species carnivore">carnivore</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
130 <option value="-species cat">cat</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
131 <option value="-species chicken">chicken</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
132 <option value="-species 'ciona intestinalis'">ciona intestinalis</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
133 <option value="-species 'ciona savignyi'">ciona savignyi</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
134 <option value="-species cow">cow</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
135 <option value="-species danio">danio</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
136 <option value="-species diatoaea">diatoaea</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
137 <option value="-species dog">dog</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
138 <option value="-species drosophila">drosophila</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
139 <option value="-species elegans">elegans</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
140 <option value="-species fugu">fugu</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
141 <option value="-species fungi" selected="true">fungi</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
142 <option value="-species human">human</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
143 <option value="-species maize">maize</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
144 <option value="-species mammal">mammal</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
145 <option value="-species mouse">mouse</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
146 <option value="-species pig">pig</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
147 <option value="-species rat">rat</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
148 <option value="-species rice">rice</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
149 <option value="-species rodentia">rodentia</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
150 <option value="-species wheat">wheat</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
151 </param>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
152
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
153 <conditional name="adv_opts">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
154 <param name="adv_opts_selector" type="select" label="Advanced Options">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
155 <option value="basic" selected="True">Hide Advanced Options</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
156 <option value="advanced">Show Advanced Options</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
157 </param>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
158 <when value="basic" />
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
159 <when value="advanced">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
160
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
161
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
162 <param name="is_only" type="boolean" label="Mask only E coli insertion elements" truevalue="-is_only" falsevalue="" checked="false" help="Only clips E coli insertion elements out of fasta and .qual files."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
163
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
164
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
165 <param name="slow_search" type="boolean" label="Slow search" truevalue="-s" falsevalue="" checked="false" help="0-5% more sensitive, 2-3 times slower than default."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
166 <param name="quick_search" type="boolean" label="Quick search" truevalue="-q" falsevalue="" checked="false" help="5-10% less sensitive, 2-5 times faster than default."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
167 <param name="rush_search" type="boolean" label="Rush search" truevalue="-qq" falsevalue="" checked="false" help="about 10% less sensitive, 4->10 times faster than default."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
168
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
169 <param name="only_alus" type="boolean" label="Only Alus" truevalue="-alu" falsevalue="" checked="false" help="Only masks Alus (and 7SLRNA, SVA and LTR5)(only for primate DNA)."/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
170
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
171 <param name="gccalc" type="boolean" label="Use GC depended matrices, automaticly" truevalue="-gccalc" falsevalue="" checked="true" help="RepeatMasker calculates the GC content even for batch files/small seqs"/>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
172
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
173 <param name="output_files" type="select" multiple="true" label="Additional output files">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
174 <option selected="true" value="summary">Summary file</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
175 <option value="gff">GFF file</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
176 <option value="html">HTML file</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
177 <option value="mask">Mask FastA file</option>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
178 </param>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
179
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
180
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
181 <param name="gc" type="integer" value="0" label="Use GC depended matrices" help="Use matrices calculated for 'number' percentage background GC level">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
182 <validator type="in_range" min="0" />
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
183 <validator type="in_range" max="100" />
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
184 </param>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
185
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
186 </when>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
187 </conditional>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
188
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
189 </inputs>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
190 <outputs>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
191 <data name="output_std" format="tabular" label="${tool.name} on ${on_string}: Standard" />
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
192 <data name="output_mask" format="fasta" label="${tool.name} on ${on_string}: Mask sequence">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
193 <filter>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
194 (adv_opts['adv_opts_selector'] == 'advanced' and 'mask' in adv_opts['output_files'])
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
195 </filter>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
196 </data>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
197 <data name="output_summary" format="txt" label="${tool.name} on ${on_string}: Summary">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
198 <filter>(
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
199 (adv_opts['adv_opts_selector'] == 'advanced' and 'summary' in adv_opts['output_files'])
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
200 or
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
201 (adv_opts['adv_opts_selector'] == 'basic')
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
202 )
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
203 </filter>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
204 </data>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
205 <data name="output_html" format="html" label="${tool.name} on ${on_string}: HTML">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
206 <filter>(adv_opts['adv_opts_selector'] == 'advanced' and 'html' in adv_opts['output_files'])</filter>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
207 </data>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
208 <data name="output_gff" format="gff" label="${tool.name} on ${on_string}: GFF">
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
209 <filter>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
210 (adv_opts['adv_opts_selector'] == 'advanced' and 'gff' in adv_opts['output_files'])
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
211 </filter>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
212 </data>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
213 </outputs>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
214 <requirements>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
215 <requirement type="binary">RepeatMasker</requirement>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
216 </requirements>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
217 <help>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
218
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
219 .. class:: warningmark
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
220
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
221
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
222 -----
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
223
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
224 **What it does**
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
225
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
226 RepeatMasker is a program that screens DNA sequences for *interspersed repeats*
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
227 and *low complexity* DNA sequences. The output of the program is a detailed
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
228 annotation of the repeats that are present in the query sequence as well as a
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
229 modified version of the query sequence in which all the annotated repeats have
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
230 been masked (default: replaced by Ns).
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
231
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
232 -----
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
233
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
234 **How to read the results**
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
235
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
236
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
237
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
238 The annotation file contains the cross_match output lines. It lists all best matches
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
239 (above a set minimum score) between the query sequence and any of the sequences in
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
240 the repeat database or with low complexity DNA. The term "best matches" reflects
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
241 that a match is not shown if its domain is over 80% contained within the domain
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
242 of a higher scoring match, where the "domain" of a match is the region in
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
243 the query sequence that is defined by the alignment start and stop. These domains
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
244 have been masked in the returned masked sequence file. In the output, matches are
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
245 ordered by query name, and for each query by position of the start of the alignment.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
246
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
247 Example:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
248
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
249 ======== ========= ========= ========= ========== =========== ========= ========= ============ =============== =================== ================ ============== ======= ==
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
250 SW score perc div. perc del. perc ins. query seq. q-pos begin q-pos end (left) w complement matching repeat repeat class/family repeat-pos begin repeat-pos end (left) ID
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
251 ======== ========= ========= ========= ========== =========== ========= ========= ============ =============== =================== ================ ============== ======= ==
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
252 1306 15.6 6.2 0.0 HSU08988 6563 6781 \(22462) C MER7A DNA/MER2_type 336 103 \(0) 1
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
253 12204 10.0 2.4 1.8 HSU08988 6782 7714 \(21529) C TIGGER1 DNA/MER2_type 2418 1493 \(0) 2
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
254 279 3.0 0.0 0.0 HSU08988 7719 7751 \(21492) + (TTTTA)n Simple_repeat 1 33 \(0) 3
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
255 1765 13.4 6.5 1.8 HSU08988 7752 8022 \(21221) C AluSx SINE/Alu 289 1 \(23) 4
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
256 12204 10.0 2.4 1.8 HSU08988 8023 8694 \(20549) C TIGGER1 DNA/MER2_type 1493 827 \(925) 5
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
257 1984 11.1 0.3 0.7 HSU08988 8695 9000 \(20243) C AluSg SINE/Alu 305 1 \(5) 6
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
258 12204 10.0 2.4 1.8 HSU08988 9001 9695 \(19548) C TIGGER1 DNA/MER2_type 827 2 \(1591) 7
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
259 711 21.2 1.4 0.0 HSU08988 9696 9816 \(19427) C MER7A DNA/MER2_type 122 2 \(224) 8
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
260 ======== ========= ========= ========= ========== =========== ========= ========= ============ =============== =================== ================ ============== ======= ==
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
261
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
262 This is a sequence in which a Tigger1 DNA transposon has integrated into a MER7 DNA transposon copy.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
263 Subsequently two Alus integrated in the Tigger1 sequence. The simple repeat is derived from the
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
264 poly A of the Alu element. The first line is interpreted like this:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
265
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
266 :Table description:
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
267
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
268 1. **1306** = Smith-Waterman score of the match, usually complexity adjusted
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
269 The SW scores are not always directly comparable. Sometimes
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
270 the complexity adjustment has been turned off, and a variety of
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
271 scoring-matrices are used.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
272
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
273 #. **15.6** = % substitutions in matching region compared to the consensus
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
274 #. **6.2** = % of bases opposite a gap in the query sequence (deleted bp)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
275 #. **0.0** = % of bases opposite a gap in the repeat consensus (inserted bp)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
276 #. **HSU08988** = name of query sequence
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
277 #. **6563** = starting position of match in query sequence
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
278 #. **7714** = ending position of match in query sequence
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
279 #. **(22462)** = no. of bases in query sequence past the ending position of match
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
280 #. **C** = match is with the Complement of the consensus sequence in the database
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
281 #. **MER7A** = name of the matching interspersed repeat
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
282 #. **DNA/MER2_type** = the class of the repeat, in this case a DNA transposon fossil of the MER2 group (see below for list and references)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
283 #. **2418** = starting position of match in database sequence (using top-strand numbering)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
284 #. **1465** = ending position of match in database sequence
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
285 #. **(0)** = no. of bases in (complement of) the repeat consensus sequence prior to beginning of the match (so 0 means that the match extended all the way to the end of the repeat consensus sequence)
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
286 #. **1** = Identifier
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
287
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
288 An asterisk (\*) in the final column (no example shown) indicates that there is
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
289 a higher-scoring match whose domain partly (&lt;80%) includes the domain of this match.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
290
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
291 Note that the SW score and divergence numbers for the three Tigger1 lines are identical.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
292 This is because the information is derived from a single alignment (the Alus were deleted
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
293 from the query before the alignment with the Tigger element was performed).
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
294 The program makes educated guesses about many fragments if they are derived from
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
295 the same element (e.g. it knows that the MER7A fragments represent one insert).
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
296 In a next version I can identify each element with a unique ID, if interest exists
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
297 (this could help to represent repeats cleaner in graphic displays).
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
298
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
299
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
300 -------
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
301
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
302 **References**
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
303
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
304 Smit, AFA, Hubley, R and Green, P. RepeatMasker Open-3.0.
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
305
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
306 http://www.repeatmasker.org/
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
307
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
308 </help>
13df908a02b0 Initial commit
bjoern-gruening
parents:
diff changeset
309 </tool>