view tools/ilmn_pacbio/smrtpipe_filter.xml @ 1:cdcb0ce84a1b

author xuebing
date Fri, 09 Mar 2012 19:45:15 -0500
parents 9071e359b9a3
line wrap: on
line source

<tool id="smrtpipe_filter" name="SMRTpipe Filter" version="1.0.0">
  <description>Produce filtered reads from a set of PacBio primary analysis outputs.</description>
  <command interpreter="python"> --output=data/filtered_subreads.fasta --galaxy_output=${outfile} ${iniFile}
    <conditional name="source">
      <param name="input_source" type="select" label="Choose the source for the analysis inputs">
        <option value="path">Path to fofn or multiple bas.h5 paths</option>
        <option value="history">History</option>
      <when value="path">
        <repeat name="inputFiles" title="Input files">
          <param name="path" type="text" label="File path" size="75"/>
      <when value="history">
        <param name="input1" type="data" format="tabular" label="File containing input paths" />
    <param name="minimum_readlength" type="integer" value="50" label="Minimum raw readlength" />
    <param name="minimum_readscore" type="float" value="0.75" label="Minimum read quality" />
    <configfile name="iniFile">
#if $source.input_source=="history":
#for $l in open($source.input1.file_name,'r'):
#end for
#for $p in $source.inputFiles
#end for
#end if

    <data name="outfile" format="fasta" label="Filtered subreads" />

**What it does**

Filters PacBio bas.h5 files and produces a FASTA file of filtered subreads.

In PacBio SMRT sequencing, the template format is a SMRTbell: a circular
molecule with adapters at two locations in the circle.  The subreads are the
portions of the read between adapters.

**Parameter list**

Minimum readlength
    Only keep reads from ZMWs that produced this many bases or more.

Minimum read quality
    Only keep reads with overall quality scores of this value or more.  The read quality score is a *de novo* prediction of the accuracy of the read.


FASTA file of filtered reads.