Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures-Reference-Cited by-同舟云学术

Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures

Published:2023-05-24 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Abbas Usman^ORCID,Chen Jin,Shao Qing^ORCID

Abstract

ABSTRACTAlphaFold2 is reshaping biomedical research by enabling the prediction of a protein’s 3D structure solely based on its amino acid sequence. This breakthrough reduces reliance on labor-intensive experimental methods traditionally used to obtain protein structures, thereby accelerating the pace of scientific discovery. Despite the bright future, it remains unclear whether AlphaFold2 can uniformly predict the wide spectrum of proteins equally well. Systematic investigation into the fairness and unbiased nature of its predictions is still an area yet to be thoroughly explored. In this paper, we conducted an in-depth analysis of AlphaFold2’s fairness using data comprised of five million reported protein structures from its open-access repository. Specifically, we assessed the variability in the distribution of PLDDT scores, considering factors such as amino acid type, secondary structure, and sequence length. Our findings reveal a systematic discrepancy in AlphaFold2’s predictive reliability, varying across different types of amino acids and secondary structures. Furthermore, we observed that the size of the protein exerts a notable impact on the credibility of the 3D structural prediction. AlphaFold2 demonstrates enhanced prediction power for proteins of medium size compared to those that are either smaller or larger. These systematic biases could potentially stem from inherent biases present in its training data and model architecture. These factors need to be taken into account when expanding the applicability of AlphaFold2.

Publisher

Cold Spring Harbor Laboratory

Reference34 articles.

1. Anfinsen, C.B. , Principles that govern the folding of protein chains. Science, 1973. 181.

2. Deep learning techniques have significantly impacted protein structure prediction and protein design

3. AI revolutions in biology: The joys and perils of AlphaFold;EMBO Rep,2021

4. AlphaFold and the amyloid landscape;J Mol Biol,2021

5. Extending the New Generation of Structure Predictors to Account for Dynamics and Allostery;J Mol Biol,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Click, Compute, Create: A Review of Web‐based Tools for Enzyme Engineering;ChemBioChem;2024-06-03