view raceid_inspectclusters.xml @ 0:9fec5dd8fbb9 draft

planemo upload for repository commit f880060c478d42202df5b78a81329f8af56b1138
author iuc
date Thu, 22 Nov 2018 04:45:41 -0500
children 20f522154663
line wrap: on
line source

<tool id="raceid_inspectclusters" name="Cluster Inspection using RaceID" version="@VERSION_RACEID@.@VERSION_PACKAGE@.1" >
    <description>examines gene expression within clusters</description>
    <expand macro="requirements" />
Rscript '$__tool_directory__/scripts/clusterinspect.R' @GET_VERSION@

    <command detect_errors="exit_code"><![CDATA[
#set bin = 'clusterinspect.R'
Rscript '$__tool_directory__/scripts/$bin' '$userconf' 2>&1 > '$outlog'
        <configfile name="userconf" ><![CDATA[

        <expand macro="inspectclusters_inputs" />

        <data name="outpdf" format="pdf" label="${} on ${on_string}: PDF Report" />
        <data name="outlog" format="txt" label="${} on ${on_string}: Log" >
        <expand macro="inspectclusters_tests" />


RaceID is a clustering algorithm for the identification of cell types from single-cell RNA-sequencing data. It was specifically designed for the detection of rare cells which correspond to outliers in conventional clustering methods.

This module inspects the clusters generated from the previous clustering step (and requires the output RData file from it as input).

The tool offers three modes of inspection which can all be activated at the same time, resulting in a single PDF report:

 * Plot All Clusters:
   * Produces a tSNE of all clusters, as well as a force-directed Fruchterman-Reingold (F-R) layout which may have tidier plots.
 * Perform Subset Analysis:
   * tSNE and F-R plots with cells whose name match the specified regex highlighted
 * Examine Genes of Interest:
   * Expression plots highlighting a gene or genes of interest across all clusters

 * Differential Gene Testing: 
   * Examining the expression between 

 * A list of the most differentially expressed genes in each cluster
 * An output PDF that provides heatmaps for:
    * The initial and final clustering (as determined using random forest)
    * Heatmaps for each of the most differentially expressed genes in each cluster

The tool requires the RData input from the previous filtering / normalisation / confounder removal step to work.

    <expand macro="citations" />