Deep Learning Encoding for Rapid Sequence Identification on Microbiome Data-Reference-Cited by-同舟云学术

Deep Learning Encoding for Rapid Sequence Identification on Microbiome Data

Published:2022-06-24 Issue: Volume:2 Page:
ISSN:2673-7647
Container-title:Frontiers in Bioinformatics
language:
Short-container-title:Front. Bioinform.

Author:

Borgman Jacob,Stark Karen,Carson Jeremy,Hauser Loren

Abstract

We present a novel approach for rapidly identifying sequences that leverages the representational power of Deep Learning techniques and is applied to the analysis of microbiome data. The method involves the creation of a latent sequence space, training a convolutional neural network to rapidly identify sequences by mapping them into that space, and we leverage the novel encoded latent space for denoising to correct sequencing errors. Using mock bacterial communities of known composition, we show that this approach achieves single nucleotide resolution, generating results for sequence identification and abundance estimation that match the best available microbiome algorithms in terms of accuracy while vastly increasing the speed of accurate processing. We further show the ability of this approach to support phenotypic prediction at the sample level on an experimental data set for which the ground truth for sequence identities and abundances is unknown, but the expected phenotypes of the samples are definitive. Moreover, this approach offers a potential solution for the analysis of data from other types of experiments that currently rely on computationally intensive sequence identification.

Publisher

Frontiers Media SA

Subject

General Medicine

Reference52 articles.

1. Microbiome 101: Studying, Analyzing, and Interpreting Gut Microbiome Data for Clinicians;Allaband;Clin. Gastroenterol. Hepatol.,2019

2. Basic Local Alignment Search Tool;Altschul;J. Mol. Biol.,1990

3. Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns;Amir;mSystems,2017

4. MicroPheno: Predicting Environments and Host Phenotypes from 16S rRNA Gene Sequencing Using a K-Mer Based Representation of Shallow Sub-samples;Asgari;Bioinformatics,2018

5. Seeker: Alignment-free Identification of Bacteriophage Genomes by Deep Learning;Auslander;Nucleic Acids Res.,2020

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep learning methods in metagenomics: a review;Microbial Genomics;2024-04-17

2. Artificial intelligence-driven microbiome data analysis for estimation of postmortem interval and crime location;Frontiers in Microbiology;2024-01-19

3. Deep learning methods in metagenomics: a review;2023-08-08

4. 1D Barcode Detection: Novel Benchmark Datasets and Comprehensive Comparison of Deep Convolutional Neural Network Approaches;Sensors;2022-11-14