Galaxy | Tool Preview

Nucmer (version 4.0.0+galaxy1)
FastA or multi-FastA
FastA or multi-FastA
Select delta format if a plot is needed. Jbrowse is a good choice to view cram and bam tracks
Choose a match anchoring strategy
Set the distance an alignment extension will attempt to extend poor scoring regions before giving up.
Sets the minimum length of a cluster of matches.
Set the maximum diagonal difference between two adjacent anchors in a cluster.
Set the maximum diagonal difference between two adjacent anchors in a cluster as a differential fraction of the gap length.
Do not perform cluster extension step.
Choose a direction of Query Sequence to Use
Set the maximum gap between two adjacent matches in a cluster.
Set the minimum length of a single exact match.
Minimum length of an alignment, after clustering and extension.
No alignment score optimization, i.e. if an alignment extension reaches the end of a sequence, it will not backtrack to optimize the alignment score and instead terminate the alignment at the end of the sequence. (--nooptimize)
Don't simplify alignments by removing shadowed clusters. Use this option when aligning a sequence to itself to look for repeats.

nucmer is for the all-vs-all comparison of nucleotide sequences contained in multi-FastA data files. It is best used for highly similar sequence that may have large rearrangements. Common use cases are: comparing two unfinished shotgun sequencing assemblies, mapping an unfinished sequencing assembly to a finished genome, and comparing two fairly similar genomes that may have large rearrangements and duplications.

All output coordinates reference the forward strand of the involved sequence, regardless of the match direction. Also, nucmer now uses only matches that are unique in the reference sequence by default, use different Anchoring options to change this behavior.

Options::

Defaults in parentheses

nucmer

--sam-long      The original output format of nucmer, the delta format, contains only the minimum information necessary to quickly recreate the alignment.
It contains the name of the matching sequences, the length of the match, number of errors and positions of indels.
With --sam-long, it additionally reports the MD string (which specifies the mismatching positions), the sequence and, if applicable,
the quality values of the matching sequence. The long format is more expensive to compute and it generates larger output files,
but this option allows nucmer4 to match the behavior of other aligners such as Bowtie2 or BWA.

--mum             Use anchor matches that are unique in both the reference and query (false)

--maxmatch        Use all anchor matches regardless of their uniqueness (false)

-b                Set the distance an alignment extension will attempt to extend poor scoring regions
                  before giving up (200)

-c                Sets the minimum length of a cluster of matches (65)

-D                Set the maximum diagonal difference between two adjacent anchors in a cluster (5)

-d                Set the maximum diagonal difference between two adjacent anchors in a cluster as a
                  differential fraction of the gap length (0.12)

--noextend        Do not perform cluster extension step (false)

-f                Use only the forward strand of the Query sequences (false)

-r                Use only the reverse complement of the Query sequences (false)

-g                Set the maximum gap between two adjacent matches in a cluster (90)

-l                Set the minimum length of a single exact match (20)

-L                Minimum length of an alignment, after clustering and extension (0)

--nooptimize      No alignment score optimization, i.e. if an alignment extension reaches the end of a
                  sequence, it will not backtrack to optimize the alignment score and instead terminate
                  the alignment at the end of the sequence (false)

--nosimplify      Don't simplify alignments by removing shadowed clusters. Use this option when aligning
                  a sequence to itself to look for repeats (false)

--banded          Enforce absolute banding of dynamic programming matrix based on diagdiff parameter (false)

--large           Force the use of large offsets (false)

-G                Map genome to genome (long query sequences) (false)

-M                Max chunk. Stop adding sequence for a thread if more than MAX already. (50000)

mummerplot

-b             Highlight alignments with breakpoints further than breaklen nucleotides from the nearest
               sequence end

-color         Color plot lines with a percent similarity gradient or turn off all plot color (default
               color by match dir) If the plot is very sparse, edit the .gp script to plot with
               'linespoints' instead of 'lines'

-c             Generate a reference coverage plot (default for .tiling)

--filter       Only display .delta alignments which represent the "best" hit to any particular spot on
               either sequence, i.e. a one-to-one mapping of reference and query subsequences

--fat          Layout sequences using fattest alignment only

-IdR           Plot a particular reference sequence ID on the X-axis

-IdQ           Plot a particular query sequence ID on the Y-axis

-s             Set the output size to small, medium or large (--small) (--medium) (--large) (default 'small')

--SNP          Highlight SNP locations in each alignment

-title         Specify the gnuplot plot title (default none)

-x             Set the xrange for the plot '[min:max]'

-y             Set the yrange for the plot '[min:max]'

-R             Plot an ordered set of reference sequences from Rfile

-Q             Plot an ordered set of query sequences from Qfile

--layout       Layout a .delta multiplot in an intelligible fashion, this option requires the -R -Q options