AI-Driven Deep Learning Techniques in Protein Structure Prediction
-
Published:2024-08-01
Issue:15
Volume:25
Page:8426
-
ISSN:1422-0067
-
Container-title:International Journal of Molecular Sciences
-
language:en
-
Short-container-title:IJMS
Author:
Chen Lingtao1ORCID, Li Qiaomu1, Nasif Kazi Fahim Ahmad1, Xie Ying1, Deng Bobin1, Niu Shuteng2, Pouriyeh Seyedamin1ORCID, Dai Zhiyu3ORCID, Chen Jiawei4ORCID, Xie Chloe Yixin1ORCID
Affiliation:
1. College of Computing and Software Engineering, Kennesaw State University, Marietta, GA 30060, USA 2. Department of Computer Science, Bowling Green State University, Bowling Green, OH 43403, USA 3. Division of Pulmonary and Critical Care Medicine, John T. Milliken Department of Medicine, Washington University School of Medicine in St. Louis, St. Louis, MO 63110, USA 4. College of Computing, Data Science and Society, University of California, Berkeley, CA 94720, USA
Abstract
Protein structure prediction is important for understanding their function and behavior. This review study presents a comprehensive review of the computational models used in predicting protein structure. It covers the progression from established protein modeling to state-of-the-art artificial intelligence (AI) frameworks. The paper will start with a brief introduction to protein structures, protein modeling, and AI. The section on established protein modeling will discuss homology modeling, ab initio modeling, and threading. The next section is deep learning-based models. It introduces some state-of-the-art AI models, such as AlphaFold (AlphaFold, AlphaFold2, AlphaFold3), RoseTTAFold, ProteinBERT, etc. This section also discusses how AI techniques have been integrated into established frameworks like Swiss-Model, Rosetta, and I-TASSER. The model performance is compared using the rankings of CASP14 (Critical Assessment of Structure Prediction) and CASP15. CASP16 is ongoing, and its results are not included in this review. Continuous Automated Model EvaluatiOn (CAMEO) complements the biennial CASP experiment. Template modeling score (TM-score), global distance test total score (GDT_TS), and Local Distance Difference Test (lDDT) score are discussed too. This paper then acknowledges the ongoing difficulties in predicting protein structure and emphasizes the necessity of additional searches like dynamic protein behavior, conformational changes, and protein–protein interactions. In the application section, this paper introduces some applications in various fields like drug design, industry, education, and novel protein development. In summary, this paper provides a comprehensive overview of the latest advancements in established protein modeling and deep learning-based models for protein structure predictions. It emphasizes the significant advancements achieved by AI and identifies potential areas for further investigation.
Funder
Kennesaw State University
Reference131 articles.
1. Schulz, G.E., and Schirmer, R.H. (2013). Principles of Protein Structure, Springer Science & Business Media. 2. Petsko, G.A., and Ringe, D. (2004). Protein Structure and Function, New Science Press. 3. The development of specialties in science: The case of X-ray protein crystallography;Law;Sci. Stud.,1973 4. x Ray crystallography;Smyth;Mol. Pathol.,2000 5. NMR-based methods for protein analysis;Hu;Anal. Chem.,2021
|
|