Mash extends the MinHash dimensionality-reduction technique to include a pairwise mutation distance and P value significance test, enabling the efficient clustering and search of massive sequence collections. Mash reduces large sequences and sequence sets to small, representative sketches, from which global mutation distances can be rapidly estimated. |
hg clone https://toolshed.g2.bx.psu.edu/repos/iuc/mash_sketch
Name | Description | Version | Minimum Galaxy Version |
---|---|---|---|
Create a reduced representation of a sequence or set of sequences, based on min-hashes | 2.3+galaxy0 | 19.01 |