Galaxy | Tool Preview

dRep compare (version 3.4.5+galaxy0)
Genome comparison and clusterings
Genome comparison and clustering 0
Warnings
Warnings 0

dRep compare

dRep performs rapid pair-wise comparison of genome sets.

dRep compare can rapidly and accurately compare a list of genomes in a pair-wise manner. This allows identification of groups of organisms that share similar DNA content in terms of Average Nucleotide Identity (ANI).

dRep performs this in two steps:

  • first with a rapid primary algorithm (Mash)
  • second with a more sensitive algorithm (ANIm).

We can't just use Mash because, while incredibly fast, it is not robust to genome incompletenss (see Choosing parameters and Module Descriptions) and only provides an “estimate” of ANI. ANIm is robust to genome incompleteness and is more accurate, but too slow to perform pair-wise comparisons of longer genome lists.

dRep first compares all genomes using Mash, and then only runs the secondary algorithm (ANIm or gANI) on sets of genomes that have at least 90% Mash ANI. This results in a great decrease in the number of (slow) secondary comparisons that need to be run while maintaining the sensitivity of ANIm.

INPUTS

  • Genome sets in fasta format.

OUTPUTS

  • Figures that show the relationship of the Genome inputs.
  • Warnings report two things: de-replicated genome similarity and secondary clusters that were almost different.