A KAN-based hybrid deep neural networks for accurate identification of transcription factor binding sites

Author:

He Guodong1,Ye Jiahao1,Hao Huijun1,Chen Wei1

Affiliation:

1. Wenzhou Business College

Abstract

Abstract

Background: Predicting protein-DNA binding sites in vivo is a challenging but urgent task in many fields such as drug design and development. Most promoters contain many transcription factor (TF) binding sites, but only a small number of sites have been identified by time-consuming biochemical experiments. To address this challenge, numerous computational approaches have been proposed to predict TF binding sites from DNA sequences. However, current deep learning methods often face issues such as gradient vanishing as the model depth increases, leading to suboptimal feature extraction. Results: We propose a model called CRA-KAN (where C stands for convolutional neural network, R stands for recurrent neural network, and A stands for attention mechanism) to predict transcription factor binding sites. This hybrid deep neural network incorporates the KAN network to replace the traditional multi-layer perceptron, combines convolutional neural networks with bidirectional long short-term memory (BiLSTM) networks, and utilizes an attention mechanism to focus on DNA sequence regions with transcription factor binding motifs. Residual connections are introduced to facilitate optimization by learning residuals between network layers. Testing on 50 common ChIP-seq benchmark datasets shows that CRA-KAN outperforms other state-of-the-art methods like DeepBind, DanQ, DeepD2V, and DeepSEA in predicting TF binding sites. Conclusions: The CRA-KAN model significantly improves prediction accuracy for transcription factor binding sites by effectively integrating multiple neural network architectures and mechanisms. This approach not only enhances feature extraction but also stabilizes training and boosts generalization capabilities. The promising results on multiple key performance indicators demonstrate the potential of CRA-KAN in bioinformatics applications.

Publisher

Springer Science and Business Media LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3