Removes duplicate sequences using one of two modes (below), from the Usearch-Tool-Suite.
File of reads in FASTA format.
A FASTA file containing only unique sequences according to the criteria chosen for the duplicate detection. The identifier line for each sequence states the representative sequence followed by the number of identical sequences found.
e.g. >sequenceXXXX;size=1443;
sequenceXXXX is the representative of 1443 identical sequences.
Author
Robert C. Edgar (bob@drive5.com)
Wrapper Author
QFAB Bioinformatics (support@qfab.org)