Using big sequencing data to identify chronic SARS-Coronavirus-2 infections

Author:

Harari Sheri,Miller Danielle,Fleishon Shay,Burstein David,Stern AdiORCID

Abstract

AbstractThe evolution of SARS-Coronavirus-2 (SARS-CoV-2) has been characterized by the periodic emergence of highly divergent variants, many of which may have arisen during chronic infections of immunocompromised individuals. Here, we harness a global phylogeny of ∼11.7 million SARS-CoV-2 genomes and search for clades composed of sequences with identical metadata (location, age, and sex) spanning more than 21 days. We postulate that such clades represent repeated sampling from the same chronically infected individual. A set of 271 such chronic-like clades was inferred, and displayed signatures of an elevated rate of adaptive evolution, in line with validated chronic infections. More than 70% of adaptive mutations present in currently circulating variants are found in BA.1 chronic-like clades that predate the circulating variants by months, demonstrating the predictive nature of such clades. We find that in chronic-like clades the probability of observing adaptive mutations is approximately 10-20 higher than that in global transmission chains. We next employ language models to find mutations most predictive of chronic infections and use them to infer hundreds of additional chronic-like clades in the absence of metadata and phylogenetic information. Our proposed approach presents an innovative method for mining extensive sequencing data and providing valuable insights into future evolutionary patterns.

Publisher

Cold Spring Harbor Laboratory

Reference51 articles.

1. SARS-CoV-2 Variants of Interest and Concern naming scheme conducive for global discourse;Nat. Microbiol,2021

2. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool;Virus Evol,2021

3. Rambaut, A. et al. Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. Virological https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563 (2020).

4. Drivers of adaptive evolution during chronic SARS-CoV-2 infections;Nat. Med,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3