Bridging the gap between molecular and genomic epidemiology in tuberculosis: inferring MIRU-VNTR patterns from genomic data

Author:

Buenestado-Serrano Sergio123ORCID,Martínez-Lirola Miguel4,Dippenaar Anzaan56ORCID,Sanz-Pérez Amadeo12,Garrido-Cárdenas José Antonio7,Esteban-García Ana Belén8,García-Toledo Adriana Justine19,Rodríguez-Grande Cristina12,Herranz-Martín Marta12,Saleeb Sheri M.12,Muñoz Patricia12910ORCID,Warren Robin M.6ORCID,Pérez-Lago Laura12ORCID,García de Viedma Darío129ORCID

Affiliation:

1. Servicio de Microbiología Clínica y Enfermedades Infecciosas, Hospital General Universitario Gregorio Marañón, Madrid, Spain

2. Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain

3. Escuela de Doctorado, Universidad de Alcalá, Plaza de San Diego, Alcalá de Henares, Madrid, Spain

4. Unidad de Gestión de Laboratorios, UGMI, Complejo Hospitalario Torrecárdenas, Almería, Spain

5. Department of Family Medicine and Population Health, Global Health Institute, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium

6. South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa

7. Departamento de Biología y Geología, Universidad de Almería, Almería, Spain

8. Servicio de Análisis de Ácidos Nucleicos, Servicios Centrales de Investigación de la Universidad de Almería, Almería, Spain

9. CIBER de Enfermedades Respiratorias (CIBERES), Servicio Madrileño de Salud, Hospital General Universitario Gregorio Marañón, Madrid, Spain

10. Departamento de Medicina, Universidad Complutense, Madrid, Spain

Abstract

ABSTRACT The transition from MIRU-VNTR-based epidemiology studies in tuberculosis (TB) to genomic epidemiology has transformed how we track transmission. However, short-read sequencing is poor at analyzing repetitive regions such as the MIRU-VNTR loci. This causes a gap between the new genomic data and the large amount of information stored in historical databases. Long-read sequencing could bridge this knowledge gap by allowing analysis of repetitive regions. However, the feasibility of extracting MIRU-VNTRs from long reads and linking them to historical data has not been evaluated. In our study, an in silico arm, consisting of inference of MIRU patterns from long-read sequences (using MIRUReader program), was compared with an experimental arm, involving standard amplification and fragment sizing. We analyzed overall performance on 39 isolates from South Africa and confirmed reproducibility in a sample enriched with 62 clustered cases from Spain. Finally, we ran 25 consecutive incident cases, demonstrating the feasibility of correctly assigning new clustered/orphan cases by linking data inferred from genomic analysis to MIRU-VNTR databases. Of the 3,024 loci analyzed, only 11 discrepancies (0.36%) were found between the two arms: three attributed to experimental error and eight to misassigned alleles from long-read sequencing. A second round of analysis of these discrepancies resulted in agreement between the experimental and in silico arms in all but one locus. Adjusting the MIRUReader program code allowed us to flag potential in silico misassignments due to suboptimal coverage or unfixed double alleles. Our study indicates that long-read sequencing could help address potential chronological and geographical gaps arising from the transition from molecular to genomic epidemiology of tuberculosis. IMPORTANCE The transition from molecular epidemiology in tuberculosis (TB), based on the analysis of repetitive regions (VNTR-based genotyping), to genomic epidemiology transforms in the precision with which we track transmission. However, short-read sequencing, the most common method for performing genomic analysis, is poor at analyzing repetitive regions. This means that we face a gap between the new genomic data and the large amount of information stored in historical databases, which is also an obstacle to cross-national surveillance involving settings where only molecular data are available. Long-read sequencing could help bridge this knowledge gap by allowing analysis of repetitive regions. Our study demonstrates that MIRU-VNTR patterns can be successfully inferred from long-read sequences, allowing the correct assignment of new cases as clustered/orphan by linking new data extracted from genomic analysis to historical MIRU-VNTR databases. Our data may provide a starting point for bridging the knowledge gap between the molecular and genomic eras in tuberculosis epidemiology.

Funder

MEC | Instituto de Salud Carlos III

EC | European Regional Development Fund

Publisher

American Society for Microbiology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3