Optimized model architectures for deep learning on genomic data-Reference-Cited by-同舟云学术

Optimized model architectures for deep learning on genomic data

Published:2024-04-30 Issue:1 Volume:7 Page:
ISSN:2399-3642
Container-title:Communications Biology
language:en
Short-container-title:Commun Biol

Author:

Gündüz Hüseyin Anil,Mreches René,Moosbauer Julia^ORCID,Robertson Gary,To Xiao-Yin^ORCID,Franzosa Eric A.^ORCID,Huttenhower Curtis^ORCID,Rezaei Mina^ORCID,McHardy Alice C.^ORCID,Bischl Bernd,Münch Philipp C.^ORCID,Binder Martin^ORCID

Abstract

AbstractThe success of deep learning in various applications depends on task-specific architecture design choices, including the types, hyperparameters, and number of layers. In computational biology, there is no consensus on the optimal architecture design, and decisions are often made using insights from more well-established fields such as computer vision. These may not consider the domain-specific characteristics of genome sequences, potentially limiting performance. Here, we present GenomeNet-Architect, a neural architecture design framework that automatically optimizes deep learning models for genome sequence data. It optimizes the overall layout of the architecture, with a search space specifically designed for genomics. Additionally, it optimizes hyperparameters of individual layers and the model training procedure. On a viral classification task, GenomeNet-Architect reduced the read-level misclassification rate by 19%, with 67% faster inference and 83% fewer parameters, and achieved similar contig-level accuracy with ~100 times fewer parameters compared to the best-performing deep learning baselines.

Funder

Deutsche Forschungsgemeinschaft

Bundesministerium für Bildung und Forschung

Deutsches Zentrum für Infektionsforschung

U.S. Department of Health & Human Services | NIH | National Institute of Allergy and Infectious Diseases

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s42003-024-06161-1.pdf

Reference42 articles.

1. LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).

2. AlQuraishi, M. AlphaFold at CASP13. Bioinformatics 35, 4862–4865 (2019).

3. Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015 234–241 (Springer International Publishing, 2015).

4. Daoud, M. & Mayo, M. A survey of neural network-based cancer prediction models from microarray data. Artif. Intell. Med. 97, 204–214 (2019).

5. Patterson, J. & Gibson, A. Deep Learning: A Practitioner’s Approach. (‘O’Reilly Media, Inc.’ 2017).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Author Correction: Optimized model architectures for deep learning on genomic data;Communications Biology;2024-05-23