Author:
Li Yi-Fan,Pan Xiaoyong,Shen Hong-Bin
Abstract
AbstractNuclear localization signals (NLSs) are essential peptide fragments within proteins that play a decisive role in guiding proteins into the cell nucleus. Determining the existence and precise locations of NLSs experimentally is time-consuming and complicated, resulting in a scarcity of experimentally validated NLS fragments. Consequently, annotated NLS datasets are relatively small, presenting challenges for data-driven methods. In this study, we propose an innovative interpretable approach, NLSExplorer, which leverages large-scale protein language models to capture crucial biological information with a novel attention-based deep network for NLS identification. By utilizing the knowledge retrieved from protein language models, NLSExplorer achieves superior predictive performance compared to existing methods on two NLS benchmark datasets. Additionally, NLSExplorer is able to detect various kinds of segments highly correlated with nuclear transport, such as nuclear export signals. We employ NLSExplorer to investigate potential NLSs and other domains that are important for nuclear transport in nucleus-localized proteins within the Swiss-Prot database. Further comprehensive pattern analysis for all these segments uncovers a potential NLS space and internal relationship of important nuclear transport segments for 416 species. This study not only introduces a powerful tool for predicting and exploring NLS space, but also offers a versatile network that detects characteristic domains and motifs of NLSs.
Publisher
Cold Spring Harbor Laboratory
Reference60 articles.
1. Prediction of protein subcellular localization;Proteins: Structure, Function, and Bioinformatics,2006
2. Nuclear localization signals (NLS);Critical reviews in eukaryotic gene expression,1993
3. Types of nuclear localization signals and mechanisms of protein import into the nucleus;Cell communication and signaling,2021
4. Yu, M. et al. Visualizing the disordered nuclear transport machinery in situ. Nature, 1–8 (2023).
5. An Argonaute Transports siRNAs from the Cytoplasm to the Nucleus