Abstract
AbstractLiquid-liquid phase separation (LLPS) of proteins is thought to be a primary driving force for the formation of membraneless organelles, which control a wide range of biological functions from stress response to ribosome biogenesis. LLPS of proteins in cells is primarily, though not exclusively, driven by intrinsically disordered (ID) domains. Accurate identification of ID regions (IDRs) that drive phase separation is important for testing the underlying mechanisms of phase separation, identifying biological processes that rely on phase separation, and designing sequences that modulate phase separation. To identify IDRs that drive phase separation, we first curated datasets of folded, ID, and phase-separating (PS) ID sequences. We then used these sequence sets to examine how broadly existing amino acids scales can be used to distinguish between the three classes of protein regions. We found that there are robust property differences between the classes and, consequently, that numerous combinations of amino acid property scales can be used to make robust predictions of LLPS. This result indicates that multiple, redundant mechanisms contribute to the formation of phase-separated droplets from IDRs. The top-performing scales were used to further optimize our previously developed predictor of PS IDRs, ParSe. We then modified ParSe to account for interactions between amino acids and obtained reasonable predictive power for mutations that have been designed to test the role of amino acid interactions in driving LLPS.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献