AYUKA: A toolkit for fast viral genotyping using whole genome sequencing

Author:

Guerra-Assunção José AfonsoORCID,Goldstein Richard,Breuer Judith

Abstract

AbstractTechnological advances enabled the frequent use of whole genome sequencing in the clinical microbiology laboratory. While generating data is now easier than ever, the computational resources and expertise required for analysis are still a challenge for clinical applications. Since it is not always possible to collect clinical specimens at the peak viral load, sequencing results are also not always amenable for analysis with bioinformatics pipelines that always require high quality data.Here we present a fast and reliable method, we named AYUKA, for analysis of viral sequencing data that does not require data pre-processing and provides quality control metrics including estimates for sequencing depth and genome coverage, as well as identifying the viral genotypes in a sample and distinguishing mixed infection from recombinants.This method can be applied to any virus where a classification by genotype is employed and determining it is relevant. We generated a validation dataset composed of cultured and sequenced reference adenoviruses from distinct species, that we compared with the gold standard clinical processing pipeline currently implemented to demonstrate reliability. The validation shows better sensitivity than mapping and perfect specificity in detecting the correct genotypes and in a wide range of adenovirus species. Run time was consistently under one minute per sample on a standard laptop, allowing the analysis of more than 100 samples per hour.This open-source method is available at https://github.com/afonsoguerra/AYUKA and precomputed databases are available at https://zenodo.org/record/6521576 allowing analysis of raw data straight from the sequencer within minutes on a standard computer, with minimum setup or expertise required to perform the analysis.The information contained within the AYUKA report can be of use for both the clinical team that collected the sample, but also for guiding the bioinformatics analysis team in the in-depth downstream analyses and genetic epidemiology investigations.

Publisher

Cold Spring Harbor Laboratory

Reference18 articles.

1. ‘BBMap’. 2022. SourceForge. 2022. https://sourceforge.net/projects/bbmap/.

2. Norovirus Transmission Dynamics in a Pediatric Hospital Using Full Genome Sequences

3. ‘Development and Implementation of a Cleaning Standard Algorithm to Monitor the Efficiency of Terminal Cleaning in Removing Adenovirus within a Pediatric Hematopoietic Stem Cell Transplantation Unit’;American Journal of Infection Control,2015

4. ‘GENOMIC INVESTIGATIONS OF ACUTE HEPATITIS OF UNKNOWN AETIOLOGY IN CHILDREN | MedRxiv’. n.d. Accessed 14 August 2022. https://www.medrxiv.org/content/10.1101/2022.07.28.22277963v1.

5. Use of Whole-Genome Sequencing of Adenovirus in Immunocompromised Pediatric Patients to Identify Nosocomial Transmission and Mixed-Genotype Infection

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3