view sucos_max.xml @ 3:bf99565cec1f draft

"planemo upload for repository commit 8542cbcae3ebed4cb9a6c20b1fabd418a6efb7e8"
author bgruening
date Sat, 28 Mar 2020 05:16:25 -0400
parents 2f110aef9b53
children 85fad59f8168
line wrap: on
line source

<tool id="sucos_max_score" name="Max SuCOS score" version="0.2.1">
    <description>- determine maximum SuCOS score of ligands against clustered fragment hits</description>
    <expand macro="requirements"/>
    <command detect_errors="exit_code"><![CDATA[
        python '$__tool_directory__/'
            -i '$input'
            -o '$output'
        #for $cluster in $clusters
        #end for
        <param name="input" type="data" format="sdf" label="Ligands to be scored" help="Input in SDF format." />
        <param name="clusters" type="data" format="sdf" multiple="true" label="Set of clusters to score against" help="Clusters in SDF format." />
        <data format="sdf" name="output" label="The scored ligands"/>
            <param name="input" ftype="sdf" value="sucos_cluster.sdf"/>
            <param name="clusters" ftype="sdf" value="cluster1.sdf,cluster2.sdf,cluster3.sdf,cluster4.sdf,cluster5.sdf,cluster6.sdf"/>
            <output name="output" ftype="sdf">
                    <has_text text="Max_SuCOS_Score" />
                    <has_text text="Cum_SuCOS_Score" />

.. class:: infomark

**What it does**

This tool determines the maximum SuCOS score of ligands, presumed to be potential follow on compounds, compared to a
set of clustered reference compounds, presumed to be fragment screening hits. Each ligand to be scored is compared to
all of the reference compounds with the highest score being recorded, along with the cluster it came from and the index
of the molecule within that cluster.

The original SuCOS code is on GitHub_ under a MIT license. The SuCOS work is described here_.

.. _GitHub:
.. _here:

.. class:: infomark


The clustered reference compounds are likely to have been generated using the "Cluster ligands using SuCOS" tool and
will comprise a SDF format file for each cluster. The ligands to be scored are supplied in a SDF file.

.. class:: infomark


The same SD file as the input ligands with  the following properties added:

* Max_SuCOS_Score - the best (maximum) SuCOS score
* Max_SuCOS_FeatureMap_Score - the corresponding FeatureMap score
* Max_SuCOS_Protrude_Score - the corresponding Protrude score
* Max_SuCOS_Cluster - the file name of the cluster that contained the max score
* Max_SuCOS_Index - the index of the cluster that contained the max score (first record is index 1)
* Cum_SuCOS_Score - the cumulative SuCOS score for all overlays (the sum of the individual scores)
* Cum_SuCOS_FeatureMap_Score - the corresponding FeatureMap score
* Cum_SuCOS_Protrude_Score - the corresponding Protrude score

    <expand macro="citations"/>