Galaxy | Tool Preview

dada2: mergePairs (version 1.34.0+galaxy1)
despite the parameter name the sequences don't need to be dereplicated
despite the parameter name the sequences don't need to be dereplicated
The forward and reverse-complemented reverse read are concatenated rather than merged, with a 10 Ns spacer inserted between them
Overhangs are when the reverse read extends past the start of the forward read, and vice-versa, as can happen when reads are longer than the amplicon and read into the other-direction primer region
Contains detailed information on the merged sequences, e.g. number of matched/mismatches

Description

This function attempts to merge each denoised pair of forward and reverse reads, rejecting any pairs which do not sufficiently overlap (at least 12bp by default) or which contain too many (>0 by default) mismatches in the overlap region.

Usage

Input

  • reads and learned error rates of the forward reads
  • reads and learned error rates of the reverse reads

Output

  • a data set of type dada2_mergepairs (which is a RData file containing the output of dada2's mergePairs function).

Details

Overview

The intended use of the dada2 tools for paired sequencing data is shown in the following image.

/repository/static/images/9cdb4e6d6e82ef7f/pairpipe.png

Note: In particular for the analysis of paired collections the collections should be sorted lexicographical before the analysis.

For single end data you the steps "Unzip collection" and "mergePairs" are not necessary.

More information may be found on the dada2 homepage:: https://benjjneb.github.io/dada2/index.html (in particular tutorials) or the documentation of dada2's R package https://bioconductor.org/packages/release/bioc/html/dada2.html (in particular the pdf which contains the full documentation of all parameters)