view space2underscore_readname.xml @ 1:99ec84eb0bab draft default tip

Uploaded
author arkarachai-fungtammasan
date Wed, 01 Apr 2015 17:00:21 -0400
parents 70f8259b0b30
children
line wrap: on
line source

<tool id="space2underscore_readname" name="Read name modifier" version="1.0.0">
  <description>--change space to underscore of a specific column</description>
  <command interpreter="python">changespacetounderscore_readname.py  $input $output $column_n </command>

  <inputs>
    <param name="input" type="data" label="Select input" />
    <param name="column_n" type="integer" value="6" label="Select column to modify" />
  </inputs>
  <outputs>
    <data format="tabular" name="output" />
    
  </outputs>
  <tests>
    <!-- Test data with valid values -->
    <test>
      <param name="input" value="samplefq.snoope"/>
      <param name="column_n" value="6"/>
      <output name="output" file="samplefq.snoope.new"/>
    </test>
    
  </tests>
  <help>


.. class:: infomark

**What it does**

This tool is used to change space to underscore. For TRFM pipeline (profiling microsatellites in short read data), this tool is used to change space in read name to underscore to prevent the downstream tools which might recognize incorrect column number due to space in read name. If the input do not have space in read name, this step can be skipped.

**Citation**

When you use this tool, please cite **Fungtammasan A, Ananda G, Hile SE, Su MS, Sun C, Harris R, Medvedev P, Eckert K, Makova KD. 2015. Accurate Typing of Short Tandem Repeats from Genome-wide Sequencing Data and its Applications, Genome Research**
 
**Input**

The input files can be any tab delimited file. 

If this tool is used in TRFM microsatellite profiling, it should be in the same format as output from **microsatellite detection program**. This format contains **length of repeat**, **length of left flanking region**, **length of right flanking region**, **repeat motif**, **hamming (editing) distance**, **read name**, **read sequence**, **read quality score**

**Output**

The same as input format.


</help>
</tool>