Deblur sequences using a 16S positive filter.
Perform sequence quality control for Illumina data using the Deblur
workflow with a 16S reference as a positive filter. Only forward reads are
supported at this time. The specific reference used is the 88% OTUs from
Greengenes 13_8. This mode of operation should only be used when data were
generated from a 16S amplicon protocol on an Illumina platform. The
reference is only used to assess whether each sequence is likely to be 16S
by a local alignment using SortMeRNA with a permissive e-value; the
reference is not used to characterize the sequences.
Parameters
- demultiplexed_seqs : SampleData[SequencesWithQuality | PairedEndSequencesWithQuality | JoinedSequencesWithQuality]
- The demultiplexed sequences to be denoised.
- trim_length : Int
- Sequence trim length, specify -1 to disable trimming.
- left_trim_len : Int % Range(0, None), optional
- Sequence trimming from the 5' end. A value of 0 will disable this trim.
- sample_stats : Bool, optional
- If true, gather stats per sample.
- mean_error : Float, optional
- The mean per nucleotide error, used for original sequence estimate.
- indel_prob : Float, optional
- Insertion/deletion (indel) probability (same for N indels).
- indel_max : Int, optional
- Maximum number of insertion/deletions.
- min_reads : Int, optional
- Retain only features appearing at least min_reads times across all
samples in the resulting feature table.
- min_size : Int, optional
- In each sample, discard all features with an abundance less than
min_size.
- hashed_feature_ids : Bool, optional
- If true, hash the feature IDs.
Returns
- table : FeatureTable[Frequency]
- The resulting denoised feature table.
- representative_sequences : FeatureData[Sequence]
- The resulting feature sequences.
- stats : DeblurStats
- Per-sample stats if requested.