LoReTTA, a user-friendly tool for assembling viral genomes from PacBio sequence data

Author:

Al Qaffas Ahmed1,Nichols Jenna2,Davison Andrew J2,Ourahmane Amine1,Hertel Laura3,McVoy Michael A1,Camiolo Salvatore2ORCID

Affiliation:

1. Department of Pediatrics, Virginia Commonwealth University, Richmond, VA, USA

2. MRC-University of Glasgow Centre for Virus Research, Glasgow, UK

3. Department of Pediatrics, School of Medicine, University of California San Francisco, Oakland, CA, USA

Abstract

Abstract Long-read, single-molecule DNA sequencing technologies have triggered a revolution in genomics by enabling the determination of large, reference-quality genomes in ways that overcome some of the limitations of short-read sequencing. However, the greater length and higher error rate of the reads generated on long-read platforms make the tools used for assembling short reads unsuitable for use in data assembly and motivate the development of new approaches. We present LoReTTA (Long Read Template-Targeted Assembler), a tool designed for performing de novo assembly of long reads generated from viral genomes on the PacBio platform. LoReTTA exploits a reference genome to guide the assembly process, an approach that has been successful with short reads. The tool was designed to deal with reads originating from viral genomes, which feature high genetic variability, possible multiple isoforms, and the dominant presence of additional organisms in clinical or environmental samples. LoReTTA was tested on a range of simulated and experimental datasets and outperformed established long-read assemblers in terms of assembly contiguity and accuracy. The software runs under the Linux operating system, is designed for easy adaptation to alternative systems, and features an automatic installation pipeline that takes care of the required dependencies. A command-line version and a user-friendly graphical interface version are available under a GPLv3 license at https://bioinformatics.cvr.ac.uk/software/ with the manual and a test dataset.

Funder

National Institutes of Health

Wellcome Trust

Medical Research Council

Publisher

Oxford University Press (OUP)

Subject

Virology,Microbiology

Reference43 articles.

1. Genome Sequence of Human Cytomegalovirus Ig-KG-H2, a Variant of Strain KG Propagated in the Presence of Neutralizing Antibodies;Al;Microbiology Resource Announcements,2020

2. Basic Local Alignment Search Tool;Altschul;Journal of Molecular Biology,1990

3. Opportunities and Challenges in Long-Read Sequencing Data Analysis;Amarasinghe;Genome Biology,2020

4. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing;Bankevich;Journal of Computational Biology: A Journal of Computational Molecular Cell Biology,2012

5. Structural Basis for Potent Antibody-Mediated Neutralization of Human Cytomegalovirus;Chandramouli;Science Immunology,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3