diff SMART/galaxy/mapperAnalyzer.xml @ 15:440ceca58672

Uploaded
author m-zytnicki
date Mon, 22 Apr 2013 11:08:07 -0400
parents 769e306b7933
children 94ab73e8a190
line wrap: on
line diff
--- a/SMART/galaxy/mapperAnalyzer.xml	Fri Apr 19 10:13:11 2013 -0400
+++ b/SMART/galaxy/mapperAnalyzer.xml	Mon Apr 22 11:08:07 2013 -0400
@@ -1,5 +1,5 @@
 <tool id="mapperAnalyzer" name="mapper analyzer">
-	<description>Read the output of an aligner, print statistics and possibly translate into BED or GBrowse formats. </description>
+	<description>Read the output of an aligner, print statistics and possibly translate into GFF, BED or GBrowse formats. </description>
 	<command interpreter="python">
 		../Java/Python/mapperAnalyzer.py -i $formatType.inputFileName1
 		#if $formatType.FormatInputFileName1 == 'bed':
@@ -89,7 +89,6 @@
 			</when>
 		</conditional>
 
-
 		<conditional name="optionnumber">
 			<param name="number" type="select" label="max. number of occurrences of a sequence">
 					<option value="Yes">Yes</option>
@@ -183,4 +182,21 @@
 		<data name="outputFileGFF" format="gff3" label="[mapperAnalyzer] out file"/>
 	</outputs> 
 	
+	<help>
+Maybe the first program you may use. It reads a set of mapping given by the tool you have used to map your data on the reference genome and translate it to a set of genomic coordinates. You also have the possibility to extract only those that you are interested in (few matches in the genome, few errors in the mapping, etc.). You can also select those reads which map less than a given of times in the genome. Moreover, you can output the data in various different formats, which you can use to visualize them *via* UCSC genome browser or GBrowse. Unmatched reads can be written in an other file, in case you would like to try to map them with another tool (may sometimes work!).
+
+You can filter your data according to:
+
+- number of errors in the mapping
+
+- number of occurrences of the mapping in the genome
+
+- size of the read mapped
+
+- number of gaps in the mapping
+
+The script needs an input file (your mapped reads) together with its format and the read sequences file together with its format (FASTA or FASTQ). If you want, you can also append the results of this script to another GFF3 file. This is useful when the GFF3 file is the result of the mapping using another tool.
+
+By default, any gap in the alignment to the reference sequence is treated like an exon. You can decide to remove this feature by merging short introns (actually, gaps).
+	</help>
 </tool>