comparison SMART/galaxy/mapperAnalyzer.xml @ 15:440ceca58672

Uploaded
author m-zytnicki
date Mon, 22 Apr 2013 11:08:07 -0400
parents 769e306b7933
children 94ab73e8a190
comparison
equal deleted inserted replaced
14:c79b9ae3f65f 15:440ceca58672
1 <tool id="mapperAnalyzer" name="mapper analyzer"> 1 <tool id="mapperAnalyzer" name="mapper analyzer">
2 <description>Read the output of an aligner, print statistics and possibly translate into BED or GBrowse formats. </description> 2 <description>Read the output of an aligner, print statistics and possibly translate into GFF, BED or GBrowse formats. </description>
3 <command interpreter="python"> 3 <command interpreter="python">
4 ../Java/Python/mapperAnalyzer.py -i $formatType.inputFileName1 4 ../Java/Python/mapperAnalyzer.py -i $formatType.inputFileName1
5 #if $formatType.FormatInputFileName1 == 'bed': 5 #if $formatType.FormatInputFileName1 == 'bed':
6 -f bed 6 -f bed
7 #elif $formatType.FormatInputFileName1 == 'gff3': 7 #elif $formatType.FormatInputFileName1 == 'gff3':
87 <when value="fastq"> 87 <when value="fastq">
88 <param name="inputFileName2" format="fastq" type="data" label="Reference sequence File Format"/> 88 <param name="inputFileName2" format="fastq" type="data" label="Reference sequence File Format"/>
89 </when> 89 </when>
90 </conditional> 90 </conditional>
91 91
92
93 <conditional name="optionnumber"> 92 <conditional name="optionnumber">
94 <param name="number" type="select" label="max. number of occurrences of a sequence"> 93 <param name="number" type="select" label="max. number of occurrences of a sequence">
95 <option value="Yes">Yes</option> 94 <option value="Yes">Yes</option>
96 <option value="No" selected="true">No</option> 95 <option value="No" selected="true">No</option>
97 </param> 96 </param>
181 180
182 <outputs> 181 <outputs>
183 <data name="outputFileGFF" format="gff3" label="[mapperAnalyzer] out file"/> 182 <data name="outputFileGFF" format="gff3" label="[mapperAnalyzer] out file"/>
184 </outputs> 183 </outputs>
185 184
185 <help>
186 Maybe the first program you may use. It reads a set of mapping given by the tool you have used to map your data on the reference genome and translate it to a set of genomic coordinates. You also have the possibility to extract only those that you are interested in (few matches in the genome, few errors in the mapping, etc.). You can also select those reads which map less than a given of times in the genome. Moreover, you can output the data in various different formats, which you can use to visualize them *via* UCSC genome browser or GBrowse. Unmatched reads can be written in an other file, in case you would like to try to map them with another tool (may sometimes work!).
187
188 You can filter your data according to:
189
190 - number of errors in the mapping
191
192 - number of occurrences of the mapping in the genome
193
194 - size of the read mapped
195
196 - number of gaps in the mapping
197
198 The script needs an input file (your mapped reads) together with its format and the read sequences file together with its format (FASTA or FASTQ). If you want, you can also append the results of this script to another GFF3 file. This is useful when the GFF3 file is the result of the mapping using another tool.
199
200 By default, any gap in the alignment to the reference sequence is treated like an exon. You can decide to remove this feature by merging short introns (actually, gaps).
201 </help>
186 </tool> 202 </tool>