Author:
Zheng Vicky,Sariyuce Ahmet Erdem,Zola Jaroslaw
Abstract
AbstractWith the emergence of portable DNA sequencers, such as Oxford Nanopore Technology MinION, metagenomic DNA sequencing can be performed in real-time and directly in the field. However, because metagenomic DNA analysis is computationally and memory intensive, and the current methods are designed for batch processing, the current metagenomic tools are not well suited for mobile devices.In this paper, we propose a new memory-efficient method to identify Operational Taxonomic Units (OTUs) in metagenomic DNA streams. Our method is based on finding connected components in overlap graphs constructed over a real-time stream of long DNA reads as produced by MinION platform. We propose an efficient algorithm to maintain connected components when an overlap graph is streamed, and show how redundant information can be removed from the stream by transitive closures. Through experiments on simulated and real-world metagenomic data, we demonstrate that the resulting solution is able to recover OTUs with high precision while remaining suitable for mobile computing devices.
Publisher
Cold Spring Harbor Laboratory
Reference35 articles.
1. I/O-efficient batched union-find and its applications to terrain analysis;ACM Transactions on Algorithms (TALG),2010
2. European Nucleotide Archive. 2019. ERR3152364. https://www.ebi.ac.uk/ena/data/view/ERR3152364.
3. Assembling large genomes with single-molecule sequencing and locality-sensitive hashing
4. S.L. Castro-Wallace , C. Y. Chiu , K. K. John , et al. 2016. Nanopore DNA Sequencing and Genome Assembly on the International Space Station. bioRxiv (2016).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献