view CADDSuite/galaxyconfigs/tools/InputPartitioner.xml @ 0:bac3c274238f

Migrated tool version 0.93 from old tool shed archive to new tool shed repository
author marcel
date Tue, 07 Jun 2011 16:43:30 -0400
parents
children b7a89b15646f
line wrap: on
line source


<!--This is a configuration file for the integration of a CADDSuite tool into Galaxy (http://usegalaxy.org). This file was automatically generated using GalaxyConfigGenerator, so do not bother to make too many manual modifications.-->
<tool id="inputpartitioner" name="InputPartitioner" version="1.1">
    <description>split QSAR data set</description>
    <command interpreter="bash"><![CDATA[../../InputPartitioner 
#if str( $i ) != ''  and str( $i ) != 'None' :
   -i "$i"
#end if
#if str( $o ) != ''  and str( $o ) != 'None' :
   -o "$o"
#end if
#if str( $n ) != ''  and str( $n ) != 'None' :
   -n "$n"
#end if
 | tail -n 5
]]></command>
    <inputs>
        <param name="i" label="input data-file" optional="false" type="data" format="dat"/>
        <param name="n" label="number of partitions" optional="false" type="text" area="true" size="1x5" value=""/>
    </inputs>
    <outputs>
        <data name="o" format="dat"/>
    </outputs>
    <help>InputPartitioner partitions a given QSAR data set into n partitions with evenly distributed response values.
Thus, this tool can be useful as part of a nested validation pipeline.
Input is a data file as generated by InputReader.
Output will be written to n files postfixed '_TRAIN&lt;i&gt;.dat' and '_TEST&lt;i&gt;.dat', where &lt;i&gt; is the ID of the resp. partition. For each of these partitions, the training set contains only those compounds that were not selected for the resp. test set.</help>
</tool>