Bacterial strain nomenclature in the genomic era: Life Identification Numbers using a gene-by-gene approach-Reference-Cited by-同舟云学术

Bacterial strain nomenclature in the genomic era: Life Identification Numbers using a gene-by-gene approach

Published:2024-03-12 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Palma Federica^ORCID,Hennart Melanie,Jolley Keith A.,Crestani Chiara^ORCID,Wyres Kelly L.^ORCID,Bridel Sebastien,Yeats Corin A.,Brancotte Bryan,Raffestin Brice,David Sophia,Lam Margaret M. C.,Izdebski Radosław,Passet Virginie,Rodrigues Carla,Rethoret-Pasty Martin,Maiden Martin C. J.,Aanensen David M.,Holt Kathryn E.^ORCID,Criscuolo Alexis,Brisse Sylvain^ORCID

Abstract

AbstractUnified strain taxonomies are crucial for fostering international communication in microbiological research and for the epidemiological surveillance of bacterial pathogens. While multilocus sequence typing (MLST) has served as a foundation of strain taxonomy for two decades, whole genome sequencing enables more precise classifications and significantly improves discriminatory resolution. The core genome-wide extension of MLST (known as cgMLST) thus holds great promise for strain genotyping and classification, but its implementation faces challenges that include missing data, potential instability of cluster-based nomenclatures, and the necessity to ensure backwards compatibility with MLST identifiers. Life Identification Number (LIN) codes offer a solution by providing multi-level classification groups that are inherently stable. Here, we present, consolidate, and extend the cgMLST-based LIN code approach. We first develop a nicknaming system for LIN code prefixes, which enables flexible human-readable strain nomenclatures. UsingKlebsiella pneumoniae(Kp) as an example, LIN code nicknames were attributed by inheritance from MLST identifiers, thus perpetuating the legacy of MLST nomenclatures in the genomic era. We show that while 7-gene MLST sometimes conflates unrelated sublineages into the same ST, cgMLST-based LIN codes are highly concordant with phylogenetic relationships. We implement this novel LIN code-based nomenclature in the BIGSdb platform, and illustrate, with Pathogenwatch, how it can also be used in other genomic epidemiology platforms. Finally, we demonstrate the value of LIN codes for tracking the strain diversity within high-risk internationally disseminated clonal groups of Kp and protracted outbreaks. Given its stability, precision, and flexibility, we recommend the adoption of the cgMLST-based LIN code taxonomic approach for Kp and suggest that this approach is widely applicable to other bacterial pathogens.

Publisher

Cold Spring Harbor Laboratory

Reference48 articles.

1. The multilocus sequence typing network: mlst.net

2. Evolution, Population Structure, and Phylogeography of Genetically Monomorphic Bacterial Pathogens

3. EnteroBase: hierarchical clustering of 100 000s of bacterial genomes into species/subspecies and populations

4. Rapid Genomic Characterization and Global Surveillance of Klebsiella Using Pathogenwatch

5. Genomic Definition of Hypervirulent and Multidrug-ResistantKlebsiella pneumoniaeClonal Groups

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development of the Pneumococcal Genome Library, a core genome multilocus sequence typing scheme, and a taxonomic life identification number barcoding system to investigate and define pneumococcal population structure;Microbial Genomics;2024-08-13

2. Multi-country and intersectoral assessment of cluster congruence between different bioinformatics pipelines for genomics surveillance of foodborne bacterial pathogens;2024-07-25