Galaxy | Tool Preview

Cluster OTU (version 1.0.0)
Size annotation is required, see description below!
default=0.97, corresponds to minimum of 97% sequence identity

Description

Performs OTU clustering using the USEARCH-Tool-Suite.


Input

FASTA file with size annotation in the input line, e.g. >sequenceXXXX;size=1443;

Size annotation is required.

Parameters

radius
(Usearch v7.0.1002 and earlier) specifies the OTU 'radius' as the minimum identity between an OTU member sequence and the representative sequence as a fractional identity 0.0 to 1.0. Default is 0.97, which corresponds to a minimum identity of 97%.

Output

FASTA output of the representative sequences.

If there is a "size=XX" annotation in the identifier line of the output file, please note this is NOT the number of sequences in this cluster. The size annotation is then coming from the input file.


Resources

UPARSE (OTU Clustering)

Author

Robert C. Edgar (bob@drive5.com)

Wrapper Author

QFAB Bioinformatics (support@qfab.org)