Galaxy | Tool Preview

Show-Coords (version 4.0.0+galaxy1)
Merges overlapping alignments regardless of match dir or frame and does not display any idenitity information. (-b)
Set minimum percent identity to display (-I)
Display the alignment direction in the additional FRM columns (-d)
Set minimum alignment length to display (-L)
Annotate maximal alignments between two sequences, i.e. overlaps between reference and query sequences (-o)

This program parses the delta alignment output of nucmer and displays the coordinates, and other useful information about the alignments.

Output is tabular. Below is a description of each column (Default):
  • [S1] Start of the alignment region in the reference sequence
  • [E1] End of the alignment region in the reference sequence
  • [S2] Start of the alignment region in the query sequence
  • [E2] End of the alignment region in the query sequence
  • [LEN 1] Length of the alignment region in the reference sequence, measured in nucleotides
  • [LEN 2] Length of the alignment region in the query sequence, measured in nucleotides
  • [% IDY] Percent identity of the alignment, calculated as (number of exact matches) / ([LEN 1] + insertions in the query)
  • [LEN R] Length of the reference sequence
  • [LEN Q] Length of the query sequence
  • [COV R] Percent coverage of the alignment on the reference sequence, calculated as [LEN 1] / [LEN R]
  • [COV Q] Percent coverage of the alignment on the query sequence, calculated as [LEN 2] / [LEN Q]
  • [FRM R] Reading frame for the reference sequence (only with -d)
  • [FRM Q] Reading frame for the query sequence (only with -d)
  • [REF TAG] The reference FastA ID
  • [QUERY TAG] The query FastA ID

There is also an optional final column (turned on with the "Annotate" parameter) that will contain some 'annotations'. The Annotate option will annotate alignments that represent overlaps between two sequences. Sometimes, nucmer will extend adjacent clusters past one another, thus causing a somewhat redundant output, this option will notify users of such rare occurrences.

The Percent Coverage and Sequence Length options are useful when comparing two sets of assembly contigs, in that these options help determine if an alignment spans an entire contig, or is just a partial hit to a different read. The Merge option is useful when the user wishes to identify sytenic regions between two genomes, but is not particularly interested in the actual alignment similarity or appearance. This option also disregards match orientation, so should not be used if this information is needed.

Options::

-b      Merges overlapping alignments regardless of match dir or frame and does not display any
        identity information.
-d      Display the alignment direction in the additional FRM columns
-I      Set minimum percent identity to display

-L      Set minimum alignment length to display

-o      Annotate maximal alignments between two sequences, i.e. overlaps between reference and query
        sequences

-q      Sort output lines by query IDs and coordinates

-r      Sort output lines by reference IDs and coordinates