Dereplicate sequences.
Dereplicate sequence data and create a feature table and feature
representative sequences. Feature identifiers in the resulting artifacts
will be the sha1 hash of the sequence defining each feature. If clustering
of features into OTUs is desired, the resulting artifacts can be passed to
the cluster_features_* methods in this plugin.
Parameters
- sequences : SampleData[Sequences] | SampleData[SequencesWithQuality] | SampleData[JoinedSequencesWithQuality]
- The sequences to be dereplicated.
- derep_prefix : Bool, optional
- Merge sequences with identical prefixes. If a sequence is identical to
the prefix of two or more longer sequences, it is clustered with the
shortest of them. If they are equally long, it is clustered with the
most abundant.
Returns
- dereplicated_table : FeatureTable[Frequency]
- The table of dereplicated sequences.
- dereplicated_sequences : FeatureData[Sequence]
- The dereplicated sequences.