Pathogenwatch
  • Welcome to Pathogenwatch
  • 🎉Announcements
  • ▶️A "Getting Started" Tutorial
  • 🎦Video Tutorials
  • 🧐Useful Links
  • 📖How to use Pathogenwatch
    • Uploading Genomes
    • Genome Reports
    • Browsing Genomes
    • Editing Metadata
    • 🚮Deleting genomes
    • Downloads
    • Creating A Collection
    • Browsing Collections
    • Sharing a collection
    • Genomic Context Search
    • Using The Interactive Collection Views
      • The Map View
      • The Tree Viewer
      • The Filter Bar
      • The Metadata Tables
        • Uploaded Metadata
        • Typing Results
        • Genome Statistics
        • Antimicrobial Resistance
    • Private Metadata
  • 📖Technical Descriptions
    • Species Assignment
      • Speciator
    • Sequence Typing Methods
      • cgMLST
      • Genotyphi
      • Kaptive
      • Kleborate
      • Klebsiella LIN Codes
      • MLST
      • NG-MAST
      • Pangolin
      • PopPUNK
      • SeroBA
      • Vista
      • SISTR
    • Antimicrobial Resistance Prediction
      • SPN-PBP-AMR
      • Kleborate
      • Pathogenwatch AMR
    • Inctyper
    • cgMLST Clustering
    • SARS-CoV-2 Notable Mutations
    • SARS-CoV-2 Genome Tree
    • Core Genome Tree
      • Core Assignment
      • Reference Assignment
      • Core Filter
      • Tree Construction
    • Short Read Assembly
  • ❓FAQ
  • 💾Public data downloads
  • 💊WHO bacterial priority pathogens
  • 📜Release Notes 2025
  • Release Notes 2024
  • Release Notes 2023
  • Release Notes 2022
  • Release Notes 2019-2021
  • ⚠️Privacy and Terms Of Service
  • 📣How to cite
  • 🙏Acknowledgements
  • ❗Report an Issue
Powered by GitBook
On this page
  • About cgMLST-based Neighbour Finding
  • Using the Clustering Panel
  1. How to use Pathogenwatch

Genomic Context Search

Using the cgMLST clustering facility to identify related strains

PreviousSharing a collectionNextUsing The Interactive Collection Views

Last updated 3 years ago

About cgMLST-based Neighbour Finding

For species where there is a cgMLST scheme available, we also provide a single-linkage clustering-based service for finding strains within a specified threshold. The cgMLST search tool can be found in the pop-up genome report. For technical details about how cgMLST allele and ST codes are assigned, follow the links to the and .

Click on the genome name in any view to pop up the Genome Report and if there is a cgMLST scheme available the clustering panel will be visible in it.

Using the Clustering Panel

The first time a genome is uploaded, it is often necessary to first run the clustering by clicking on the "Cluster Now" button. This can take a few minutes, depending on server usage and the size of the data set involved. If you're looking at an old record but you've uploaded new genomes since you last ran the clustering, you can add these in by pressing the "Re-cluster" button.

The dark purple node includes the query sequence. The lighter purple nodes are directly connected to the query. Grey nodes are linked to the query via at least one other node.

The resulting network shows genomes linked to the query genome by fewer differences than the selected distance threshold (number of different alleles). This threshold can be modified using the slider under the chart showing the number of genomes at each value. Each node in the network represents one or more genomes that either connect directly to the query genome at less or equal to the threshold or can be linked to it via a chain of connected genomes.

This network can be opened in a collection view and the related genomes investigated ("View Cluster"). Alternatively the list of linked genomes can be opened in the view, from where detailed analysis downloads and are available ("List Genomes"). Note that you will need to click the "Clear Filters" button to reset the Genome List view afterwards.

📖
cgMLST
clustering documentation
tree-based collections
Genome List