Pathogenwatch
  • Welcome to Pathogenwatch
  • 🎉Announcements
  • ▶️A "Getting Started" Tutorial
  • 🎦Video Tutorials
  • 🧐Useful Links
  • 📖How to use Pathogenwatch
    • Uploading Genomes
    • Genome Reports
    • Browsing Genomes
    • Editing Metadata
    • 🚮Deleting genomes
    • Downloads
    • Creating A Collection
    • Browsing Collections
    • Sharing a collection
    • Genomic Context Search
    • Using The Interactive Collection Views
      • The Map View
      • The Tree Viewer
      • The Filter Bar
      • The Metadata Tables
        • Uploaded Metadata
        • Typing Results
        • Genome Statistics
        • Antimicrobial Resistance
    • Private Metadata
  • 📖Technical Descriptions
    • Species Assignment
      • Speciator
    • Sequence Typing Methods
      • cgMLST
      • Genotyphi
      • Kaptive
      • Kleborate
      • Klebsiella LIN Codes
      • MLST
      • NG-MAST
      • Pangolin
      • PopPUNK
      • SeroBA
      • Vista
      • SISTR
    • Antimicrobial Resistance Prediction
      • SPN-PBP-AMR
      • Kleborate
      • Pathogenwatch AMR
    • Inctyper
    • cgMLST Clustering
    • SARS-CoV-2 Notable Mutations
    • SARS-CoV-2 Genome Tree
    • Core Genome Tree
      • Core Assignment
      • Reference Assignment
      • Core Filter
      • Tree Construction
    • Short Read Assembly
  • ❓FAQ
  • 💾Public data downloads
  • 💊WHO bacterial priority pathogens
  • 📜Release Notes 2025
  • Release Notes 2024
  • Release Notes 2023
  • Release Notes 2022
  • Release Notes 2019-2021
  • ⚠️Privacy and Terms Of Service
  • 📣How to cite
  • 🙏Acknowledgements
  • ❗Report an Issue
Powered by GitBook
On this page
  • About
  • Method
  • Results
  • How to cite
  1. Technical Descriptions
  2. Sequence Typing Methods

cgMLST

PreviousSequence Typing MethodsNextGenotyphi

Last updated 11 months ago

About

cgMLST schemes are based around a community-agreed set of gene loci present in all strains of the species. A database of validated allele sequences is maintained for each locus and a code assigned to each one. An "ST" code is then generated from the unique combination alleles. The schemes supported by Pathogenwatch are provided by , the , , and the while an in-house search tool is used to rapidly but accurately assign the correct cgMLST assignment.

If your profile includes novel alleles or a novel MLST code, we recommend visiting the source database linked in the results page to submit your genome there. Generated assignments will be subsequently be imported in Pathogenwatch at the next update.

Method

The assembly is searched for exact matches to known alleles. A representative set of alleles for each locus are then searched for using BLAST. These searches are combined and filtered based on the similarity of the match and length of the match. Novel alleles are hashed using the SHA-1 algorithm, this is then used as their unique identifier. Profiles are assigned based on the combination of alleles detected. Novel profiles are also given a unique identifier using the .

Results

The cgMLST results are not displayed directly, but are available as a download from both collection and genome selection download menus (for more details see ). The results also serve as the basis for the for quickly finding closely related assemblies.

How to cite

Please cite the resource which hosts the cgMLST scheme. The host of the scheme should linked in individual genome reports. Please contact us if you have any questions.

The software is available under an OSS licence from and .

📖
PubMLST
Pasteur Institute
Enterobase
cgMLST.org Nomenclature Server,
SHA-1 hash algorithm
"Downloads"
cgMLST clustering method
https://github.com/pathogenwatch-oss/mlst
https://github.com/pathogenwatch-oss/typing-databases