Large scale genomic analysis of 3067 SARS-CoV-2 genomes reveals a clonal geo-distribution and a rich genetic variations of hotspots mutations

Author:

Laamarti Meriem,Alouane Tarek,Kartti SouadORCID,Chemao-Elfihri M. W.,Hakmi Mohammed,Essabbar Abdelomunim,Laamarti Mohamed,Hlali Haitam,Bendani Houda,Boumajdi Nassma,Benhrif Oussama,Allam Loubna,El Hafidi Naima,El Jaoudi RachidORCID,Allali Imane,Marchoudi Nabila,Fekkak Jamal,Benrahma Houda,Nejjari Chakib,Amzazi Saaid,Belyamani Lahcen,Ibrahimi AzeddineORCID

Abstract

In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, China, and became a global pandemic. Characterization of the genetic variants of SARS-CoV-2 is crucial in following and evaluating it spread across countries. In this study, we collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the first three months after the onset of this virus. Using comparative genomics analysis, we traced the profiles of the whole-genome mutations and compared the frequency of each mutation in the studied population. The accumulation of mutations during the epidemic period with their geographic locations was also monitored. The results showed 782 variants sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles revealed the presence of 68 recurrent mutations, including ten hotspot non-synonymous mutations with a prevalence higher than 0.10 in this population and distributed in six SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map revealed that certain genotypes are specific to geographic locations. We also identified co-occurring mutations resulting in the presence of several haplotypes. Moreover, evolution over time has shown a mechanism of mutation co-accumulation which might affect the severity and spread of the SARS-CoV-2. The phylogentic analysis identified two major Clades C1 and C2 harboring mutations L3606F and G614D, respectively and both emerging for the first time in China. On the other hand, analysis of the selective pressure revealed the presence of negatively selected residues that could be taken into considerations as therapeutic targets. We have also created an inclusive unified database (http://covid-19.medbiotech.ma) that lists all of the genetic variants of the SARS-CoV-2 genomes found in this study with phylogeographic analysis around the world.

Funder

EnsSup-covid-07, IRC

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference39 articles.

1. World Health Organization Coronavirus disease (COVID-19) Situation Report– 102, 01 Mai 2020. World Health Organization. 2020. Available from: https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200501-covid-19-sitrep.pdf?sfvrsn=742f4a18_4

2. Coronavirus host divergence and novel coronavirus (Sars-CoV-2) outbreak;K Yeşilbağ;Clinical and Experimental Ocular Trauma and Infection,2020

3. The proximal origin of SARS-CoV-2;KG Andersen;Nat Med,2020

4. A new coronavirus associated with human respiratory disease in China;F Wu;Nature,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3