Evaluating generalizability of artificial intelligence models for molecular datasets-Reference-Cited by-同舟云学术

Evaluating generalizability of artificial intelligence models for molecular datasets

Published:2024-02-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Ektefaie Yasha^ORCID,Shen Andrew,Bykova Daria,Marin Maximillian,Zitnik Marinka^ORCID,Farhat Maha

Abstract

Deep learning has made rapid advances in modeling molecular sequencing data. Despite achieving high performance on benchmarks, it remains unclear to what extent deep learning models learn general principles and generalize to previously unseen sequences. Benchmarks traditionally interrogate model generalizability by generating metadata based (MB) or sequence-similarity based (SB) train and test splits of input data before assessing model performance. Here, we show that this approach mischaracterizes model generalizability by failing to consider the full spectrum of cross-split overlap,i.e., similarity between train and test splits. We introduce SPECTRA, a spectral framework for comprehensive model evaluation. For a given model and input data, SPECTRA plots model performance as a function of decreasing cross-split overlap and reports the area under this curve as a measure of generalizability. We apply SPECTRA to 18 sequencing datasets with associated phenotypes ranging from antibiotic resistance in tuberculosis to protein-ligand binding to evaluate the generalizability of 19 state-of-the-art deep learning models, including large language models, graph neural networks, diffusion models, and convolutional neural networks. We show that SB and MB splits provide an incomplete assessment of model generalizability. With SPECTRA, we find as cross-split overlap decreases, deep learning models consistently exhibit a reduction in performance in a task- and model-dependent manner. Although no model consistently achieved the highest performance across all tasks, we show that deep learning models can generalize to previously unseen sequences on specific tasks. SPECTRA paves the way toward a better understanding of how foundation models generalize in biology.

Publisher

Cold Spring Harbor Laboratory

Reference101 articles.

1. A convolutional neural network highlights mutations relevant to antimicrobial resistance in mycobacterium tuberculosis;Nat. Commun,2022

2. Lite-SeqCNN: A light-weight deep CNN architecture for protein function prediction;IEEE/ACM Trans. Comput. Biol. Bioinform,2023

3. ProteInfer, deep neural networks for protein functional inference

4. Using deep learning to annotate the protein universe;Nature Biotechnology,2022

5. Parrot is a flexible recurrent neural network framework for analysis of large protein datasets;eLife,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TDC-2: Multimodal Foundation for Therapeutic Science;2024-06-14