A Systematic Bioinformatics Approach for Mapping the Minimal Set of a Viral Peptidome

Author:

Chong Li Chuin123ORCID,Khan Asif M.124ORCID

Affiliation:

1. Centre for Bioinformatics, School of Data Sciences Perdana University Kuala Lumpur Malaysia

2. Beykoz Institute of Life Sciences and Biotechnology Bezmialem Vakif University Beykoz Turkey

3. Current affiliation: Institute for Experimental Virology TWINCORE Centre for Experimental and Clinical Infection Research, a Medical School Hannover (MHH) and Helmholtz Centre for Infection Research (HZI) joint venture Hannover Germany

4. Current affiliation: College of Computing and Information Technology University of Doha for Science and Technology Doha Qatar

Abstract

AbstractSequence changes in viral genomes generate protein sequence diversity that enables viruses to evade the host immune system, hindering the development of effective preventive and therapeutic interventions. The massive proliferation of sequence data provides unprecedented opportunities to study viral adaptation and evolution. An alignment‐free approach removes various restrictions posed by an alignment‐dependent approach for studying sequence diversity. The publicly available tool, UNIQmin, offers an alignment‐free approach for studying viral sequence diversity at any given rank of taxonomy lineage and is big data ready. The tool performs an exhaustive search to determine the minimal set of sequences required to capture the peptidome diversity within a given dataset. This compression is possible through the removal of identical sequences and unique sequences that do not contribute effectively to the peptidome diversity pool. Herein, we describe a detailed four‐part protocol utilizing UNIQmin to generate the minimal set for the purpose of viral diversity analyses, alignment‐free at any rank of the taxonomy lineage, using the recent global public health threat Monkeypox virus (MPX) sequence data as a case study. The protocol enables a systematic bioinformatics approach to study sequence diversity across taxonomic lineages, which is crucial for our future preparedness against viral epidemics. This is particularly important when data are abundant, freely available, and alignment is not an option. © 2024 Wiley Periodicals LLC.Basic Protocol 1: Tool installation and input file preparationBasic Protocol 2: Generation of a minimal set of sequences for a given datasetBasic Protocol 3: Comparative minimal set analysis across taxonomic lineage ranksBasic Protocol 4: Factors affecting the minimal set of sequences

Publisher

Wiley

Reference45 articles.

1. Dynamics of Influenza A (H5N1) virus protein sequence diversity

2. UniProt: the universal protein knowledgebase in 2021

3. Preclinical evaluation of a vaccine based on conserved region of M protein that prevents group A streptococcal infection;Batzloff M.;The Indian Journal of Medical Research,2004

4. Data quality-aware genomic data integration

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3