Galaxy | Tool Preview

D) Plot Cladogram (version 1.0)

What it does

This module produces cladograms representing the LEfSe results (obtained with the corresponding module) on the hierarchy induced by the label names (the levels of the tree must be denoted with "|" in the original input file). The module cannot be used for features that do not have a tree structure.

Below you can find a couple of examples.


Input format

The module accepts the output of the LEfSe module.


Output format

The module generate images in png, svg or pdf format. The png format is recommended for exploratory runs as it can be easily visualized internally in Galaxy, whereas the vectorial svg and pdf format are recommended for the final publication-ready image to be downloaded.


Parameters

In addition to the output format and the dpi resolution three sets of parameters can be tuned: structural parameters affecting the property of the circular cladogram, text and label options for regulating the clade annotation and graphical options for personalizing the appearance of the plot. The default settings of the parameters should give satisfactory cladograms in the great majority of the cases.

Advanced parameter settings

Structural parameters
  • Root of the tree: selects any taxa of the tree to be the root of the cladogram (only the clades below the root will be visualized). Taxa levels are separated by ".", so for, example, Bacteria.Firmicutes will generate only the cladogram of Firmicutes.
  • Expand terminal non-leaf levels: whether to expand a non-leaf taxa without children up to the level of the leaves naming the new levels with the expanding taxa name.
  • Maximum number of taxonomic levels: you can limit the levels of the cladogram to a desired level.
Text and label options
  • The title of the cladogram: optional title for the plot (default is no title).
  • Title font size: set the font size of the title only.
  • Label font size: set the font size of the labels (and of the label legend) used in the cladogram to denote taxa.
  • Class font size: set the font of the legend for the class names and colors.
  • Starting level for drawing the labels: you can avoid naming the more internal clades if they are not informative.
  • Last level for drawing the labels: you may want to remove the most external labels as they may be to dense or overlapping.
  • First level for abbreviating the labels: set the starting level for substituting the full clade names with with an abbreviation supported by a legend table (recommended for the most external taxa).
  • Last level for abbreviating the labels: set the more external level for substituting the full clade names with with an abbreviation supported by a legend table.
Graphical options
  • Number of external levels drawn in the radial representation: the connection between the taxa in last level and the corresponding parent is represented with a straight edge. The same representation may be used for more internal levels as well.
  • Max dimension of the circles representing taxa: the sizes of the taxa represent the highest logarithmic abundance between classes, and this option sets the maximum diameter of the graphical representation.
  • Min dimension of the circles representing taxa: the taxon diameter of the smallest logarithmic taxa abundance.
  • Width of the edges of the circles representing taxa: the taxon circles have an external border whose thickness is regulated by this option.
  • Width of the lines connecting sibling taxa: set the thickness of the lines connecting sibling taxa in the non-radial representation.
  • Width of the lines connecting parent with son taxa: set the line thickness of the child-parent connection both in radial and non-radial representation.
  • The alpha value for the transparency of clade labeling: the alpha value is responsible for the transparency of the differential clade highlighting. Since the transparency is additive, the alpha value should not be higher than 1/s where s is the number of levels with differential clades.
  • Ration of the horizontal space to be given to the right side: in case the label legend requires more space (because of long labels) you may increase the right panel increasing this value.
  • Whether to write the labels with the class color or in black: set whether the clade names inside the cladogram will be displayed with the class color or in black.
  • Background color: whether to generate plots with black or white backgrounds, adjusting properly the other colors.
  • Set the space between clades at the lower level: set the separation between low-level taxa belonging to different super-clades. Depending on the density of the leaf-level this parameter is automatically adjusted.

Examples

The dataset provided here and described in the "Introduction" module produces the following image (alpha values of LEfSe - step B - are set to 0.01)

Focusing the cladogram on the Firmicutes phylum only and playing a bit with the graphical options, we can obtain the following plot: