CD-HIT is a widely used program for clustering and comparing protein or nucleotide sequences. CD-HIT is very fast and can handle extremely large databases. CD-HIT helps to reduce sequence redundancy and improve the performance of other sequence analyses. |
hg clone https://toolshed.g2.bx.psu.edu/repos/iuc/cd_hit
Name | Description | Version | Minimum Galaxy Version |
---|---|---|---|
Cluster or compare biological sequence datasets | 4.8.1+galaxy0 | 20.01 |