Galaxy | Tool Preview

MLST (version 2.22.0)

What it does

Given a genome file in FASTA or Genbank format, MLST will scan the file against PubMLST typing schemes.

Output

MLST will produce a tab-seperated output file which contains: - the filename - the closest PubMLST scheme name - the ST (sequence type) - the allele IDs

Example Output

genomes/6008.fna        saureus         239  arcc(2)   aroe(3)   glpf(1)   gmk_(1)   pta_(4)   tpi_(4)   yqil(3)

Without auto-detection

If you choose to manually set the MLST scheme, it will print a fixed tabular output with a heading containing allele names specific to that scheme (adjustable with Include allele header). To view a list of schemes, use the MLST List tool.

FILE      SCHEME     ST    abcZ  adk  aroE  fumC  gdh  pdhC  pgm
NM003.fa  neisseria  11    2     3    4     3       8     4    6

Missing data

MLST does not just look for exact matches to full length alleles. It attempts to tell you as much as possible about what it found using the notation below:

Symbol Meaning
n Exact intact allele
~n Novel full length allele similar to n
n? Partial match to known allele
n,m Multiple alleles
- Allele missing

Setting Output novel alleles to true will produce an additional novel_alleles.fasta file containing the novel alleles.

Galaxy wrapper maintained by Simon Gladman.