INSaFLU-TELEVIR: an open web-based bioinformatics suite for viral metagenomic detection and routine genomic surveillance

Author:

Santos João Dourado1,Sobral Daniel1,Pinheiro Miguel2,Isidro Joana1,Bogaardt Carlijn3,Pinto Miguel1,Eusébio Rodrigo1,Santos André1,Mamede Rafael4,Horton Daniel L3,Gomes João Paulo1,consortium* TELEVIR5,Borges Vítor1ORCID

Affiliation:

1. Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal

2. Institute of Biomedicine-iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro, Portugal

3. University of Surrey, Department of Comparative Biomedical Sciences, School of Veterinary Medicine, Surrey, The United Kingdom

4. Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal

5. https://onehealthejp.eu/projects/emerging-threats/jrp-tele-vir (See Declarations section for complete list of consortium authors)

Abstract

Abstract Background Implementation of clinical metagenomics and pathogen genomic surveillance can be particularly challenging due to the lack of bioinformatics tools and/or expertise. In order to face this challenge, we have previously developed INSaFLU (https://insaflu.insa.pt/), a free web-based bioinformatics platform for virus next-generation sequencing data analysis. Here, we considerably expanded its genomic surveillance component and developed a new module (TELEVIR) for metagenomic virus identification. Results The routine genomic surveillance component was strengthened with new workflows and functionalities, including: i) a reference-based genome assembly pipeline for Oxford Nanopore technologies (ONT) data; ii) automated SARS-CoV-2 lineage classification; iii) Nextclade analysis; iv) Nextstrain phylogeographic and temporal analysis (SARS-CoV-2, human and avian influenza, monkeypox, respiratory syncytial virus (RSV A/B), as well as a “generic” build for other viruses); and, v) algn2pheno (https://github.com/insapathogenomics/algn2pheno) for screening mutations of interest. Both INSaFLU pipelines for reference-based consensus generation (Illumina and ONT) were benchmarked against commonly used command line bioinformatics workflows for SARS-CoV-2, and an INSaFLU snakemake version was released. In parallel, a new module (TELEVIR) for virus detection was developed, after extensive benchmarking of state-of-the-art metagenomics software and following up-to-date recommendations and practices in the field. TELEVIR allows running complex workflows, covering several combinations of steps (e.g., with/without viral enrichment or host depletion), classification software (e.g., Kaiju, Kraken2, Centrifuge, FastViromeExplorer) and databases (RefSeq viral genome, Virosaurus, etc), while culminating in user- and diagnosis-oriented reports. Finally, to potentiate real-time virus detection during ONT runs, we developed findONTime (https://github.com/INSaFLU/findONTime), a tool aimed at reducing costs and the time between sample reception and diagnosis. Conclusion The accessibility, versatility and functionality of INSaFLU-TELEVIR is expected to supply public and animal health laboratories and researchers with a user-oriented and pan-viral bioinformatics framework that promotes a strengthened and timely viral metagenomic detection and routine genomics surveillance. INSaFLU-TELEVIR is compatible with Illumina, Ion Torrent and ONT data and is freely available at https://insaflu.insa.pt/ (online tool) and https://github.com/INSaFLU (code).

Funder

Horizon 2020 Framework Programme

Publisher

Research Square Platform LLC

Reference77 articles.

1. Struelens MJ, Brisse S. From molecular to genomic epidemiology: transforming surveillance and control of infectious diseases. Eurosurveillance [Internet]. 2013;18. Available from: https://www.eurosurveillance.org/content/10.2807/ese.18.04.20386-en

2. European Centre for Disease Prevention and Control (ECDC). Expert opinion on whole genome sequencing for public health surveillance. Stockholm: ECDC; 2016.

3. Eyre DW. Infection prevention and control insights from a decade of pathogen whole-genome sequencing. J Hosp Infect [Internet]. 2022;122:180–6. Available from: https://linkinghub.elsevier.com/retrieve/pii/S019567012200041X

4. Chen Z, Azman AS, Chen X, Zou J, Tian Y, Sun R, et al. Global landscape of SARS-CoV-2 genomic surveillance and data sharing. Nat Genet [Internet]. 2022;54:499–507. Available from: https://www.nature.com/articles/s41588-022-01033-y

5. Gardy JL, Loman NJ. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat Rev Genet [Internet]. 2018;19:9–20. Available from: https://www.nature.com/articles/nrg.2017.88

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3