The Rapid Evolution of De Novo Proteins in Structure and Complex

Author:

Chen Jianhai1ORCID,Li Qingrong23ORCID,Xia Shengqian1ORCID,Arsala Deanna1ORCID,Sosa Dylan1ORCID,Wang Dong23ORCID,Long Manyuan1ORCID

Affiliation:

1. Department of Ecology and Evolution, The University of Chicago , Chicago, IL 60637 , USA

2. Division of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego , La Jolla, CA 92093 , USA

3. Department of Cellular & Molecular Medicine, School of Medicine, University of California San Diego , La Jolla, CA 92093 , USA

Abstract

Abstract Recent studies in the rice genome-wide have established that de novo genes, evolving from noncoding sequences, enhance protein diversity through a stepwise process. However, the pattern and rate of their evolution in protein structure over time remain unclear. Here, we addressed these issues within a surprisingly short evolutionary timescale (<1 million years for 97% of Oryza de novo genes) with comparative approaches to gene duplicates. We found that de novo genes evolve faster than gene duplicates in the intrinsically disordered regions (such as random coils), secondary structure elements (such as α helix and β strand), hydrophobicity, and molecular recognition features. In de novo proteins, specifically, we observed an 8% to 14% decay in random coils and intrinsically disordered region lengths and a 2.3% to 6.5% increase in structured elements, hydrophobicity, and molecular recognition features, per million years on average. These patterns of structural evolution align with changes in amino acid composition over time as well. We also revealed higher positive charges but smaller molecular weights for de novo proteins than duplicates. Tertiary structure predictions showed that most de novo proteins, though not typically well folded on their own, readily form low-energy and compact complexes with other proteins facilitated by extensive residue contacts and conformational flexibility, suggesting a faster-binding scenario in de novo proteins to promote interaction. These analyses illuminate a rapid evolution of protein structure in de novo genes in rice genomes, originating from noncoding sequences, highlighting their quick transformation into active, protein complex-forming components within a remarkably short evolutionary timeframe.

Publisher

Oxford University Press (OUP)

Reference105 articles.

1. Inverse relationship between evolutionary rate and age of mammalian genes;Alba;Mol Biol Evol,2005

2. Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2;Alderson;Proc Natl Acad Sci U S A,2023

3. De novo genes with an lncRNA origin encode unique human brain developmental functionality;An;Nat Ecol Evol,2023

4. Studies on the reduction and re-formation of protein disulfide bonds;Anfinsen;J Biol Chem,1961

5. Assessing structure and disorder prediction tools for de novo emerged proteins in the age of machine learning;Aubel;F1000Res,2023

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3