comparison segalign.xml @ 23:e9bf680c292b draft default tip

planemo upload for repository https://github.com/richard-burhans/galaxytools/tree/main/tools/segalign commit a2f684fa36d99b0d8d03f2b802d08b56b8545bc6
author richard-burhans
date Mon, 19 Aug 2024 18:45:15 +0000
parents cefa7625d6cb
children
comparison
equal deleted inserted replaced
22:860e1f6f3de4 23:e9bf680c292b
161 </assert_contents> 161 </assert_contents>
162 </output> 162 </output>
163 </test> 163 </test>
164 </tests> 164 </tests>
165 <help><![CDATA[ 165 <help><![CDATA[
166 SegAlign is a scalable, GPU-accelerated system for computing pairwise WGA. SegAlign is based on the standard seed-filter-extend heuristic, in which the filtering stage dominates the runtime (e.g. 98% for human-mouse WGA), and is accelerated using GPU(s). 166 SegAlign is a scalable, GPU-accelerated system for computing pairwise WGA. SegAlign is based on the standard seed-filter-extend heuristic, in which the filtering stage dominates the runtime (e.g. 98% for human-mouse WGA), and is accelerated using GPU(s). SegAlign was designed as a faster replacement for lastz pairwise aligner.
167 167
168 https://github.com/gsneha26 168 **Using this tool**
169
170 .. class:: warningmark
171
172 This tool is the first part of a two-step process for generation of paiwrise alignments. The output of this tool is used as an input to **Batched LASTZ** tool. The *Batched LASTZ** can be found in the list of tool of this Galaxy instance.
173
174 **What it does**
175
176 SegAlign processes **Target** and **Query** sequences to identify highly similar regions where gapped extension will be performed to create actual alignments. The actual alignments are generated by **Batched LASTZ** that should be run on the output of this tool.
177
178 .. class:: infomark
179
180 Although this tool is only the first part of the alignment process all parameters related to generation of alignments are set **during this stage**.
181
182 **Scoring Options**
183
184 By default the HOXD70 substitution scores are used (from `Chiaromonte et al. 2002 <https://www.ncbi.nlm.nih.gov/pubmed/11928468>`_)::
185
186 bad_score = X:-1000 # used for sub['X'][*] and sub[*]['X']
187 fill_score = -100 # used when sub[*][*] is not defined
188 gap_open_penalty = 400
189 gap_extend_penalty = 30
190
191 A C G T
192 A 91 -114 -31 -123
193 C -114 100 -125 -31
194 G -31 -125 100 -114
195 T -123 -31 -114 91
196
197 Matrix can be supplied as an input to **Read the substitution scores** parameter in *Scoring* section. Substitution matrix can be inferred from your data using another LASTZ-based tool (LASTZ_D: Infer substitution scores).
198
199 **Output Options**
200
201 .. class:: infomark
202
203 The final format in which alignmnets will be generated by **Batched LASTZ** are set here.
204
205 The default output is a MAF alignment file. Other formats can be configured in *Output Options* section. See LASTZ manual <https://lastz.github.io/lastz/#formats>`_ for description of possible formats.
169 ]]></help> 206 ]]></help>
170 <expand macro="citations"/> 207 <expand macro="citations"/>
171 </tool> 208 </tool>