Protein fold recognition based on multi-view modeling

Author:

Yan Ke1,Fang Xiaozhao2,Xu Yong1,Liu Bin13

Affiliation:

1. School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China

2. School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, China

3. School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China

Abstract

Abstract Motivation Protein fold recognition has attracted increasing attention because it is critical for studies of the 3D structures of proteins and drug design. Researchers have been extensively studying this important task, and several features with high discriminative power have been proposed. However, the development of methods that efficiently combine these features to improve the predictive performance remains a challenging problem. Results In this study, we proposed two algorithms: MV-fold and MT-fold. MV-fold is a new computational predictor based on the multi-view learning model for fold recognition. Different features of proteins were treated as different views of proteins, including the evolutionary information, secondary structure information and physicochemical properties. These different views constituted the latent space. The ε-dragging technique was employed to enlarge the margins between different protein folds, improving the predictive performance of MV-fold. Then, MV-fold was combined with two template-based methods: HHblits and HMMER. The ensemble method is called MT-fold incorporating the advantages of both discriminative methods and template-based methods. Experimental results on five widely used benchmark datasets (DD, RDD, EDD, TG and LE) showed that the proposed methods outperformed some state-of-the-art methods in this field, indicating that MV-fold and MT-fold are useful computational tools for protein fold recognition and protein homology detection and would be efficient tools for protein sequence analysis. Finally, we constructed an update and rigorous benchmark dataset based on SCOPe (version 2.07) to fairly evaluate the performance of the proposed method, and our method achieved stable performance on this new dataset. This new benchmark dataset will become a widely used benchmark dataset to fairly evaluate the performance of different methods for fold recognition. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

National Natural Science Foundation of China

Fok Ying-Tung Education Foundation for Young Teachers in the Higher Education Institutions of China

Scientific Research Foundation in Shenzhen

Guangdong Province High-Level Personnel of Special Support Program

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Cited by 73 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Tensor-based global block-diagonal structure radiation for incomplete multiview clustering;Expert Systems with Applications;2024-12

2. Simple Multigraph Convolution Networks;Companion Proceedings of the ACM Web Conference 2024;2024-05-13

3. Incomplete multi-view learning: Review, analysis, and prospects;Applied Soft Computing;2024-03

4. Shape-aware contrastive deep supervision for esophageal tumor segmentation from CT scans;2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM);2023-12-05

5. IIFS: An improved incremental feature selection method for protein sequence processing;Computers in Biology and Medicine;2023-12

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3