Highly accurate discovery of terpene synthases powered by machine learning reveals functional terpene cyclization in Archaea-Reference-Cited by-同舟云学术

Highly accurate discovery of terpene synthases powered by machine learning reveals functional terpene cyclization in Archaea

Published:2024-01-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Samusevich Raman^ORCID,Hebra Téo^ORCID,Bushuiev Roman^ORCID,Bushuiev Anton^ORCID,Čalounová Tereza,Smrčková Helena,Chatpatanasiri Ratthachat,Kulhánek Jonáš^ORCID,Perković Milana,Engst Martin,Tajovská Adéla,Sivic Josef^ORCID,Pluskal Tomáš^ORCID

Abstract

AbstractTerpene synthases (TPSs) generate the scaffolds of the largest class of natural products, including several first-line medicines. The amount of available protein sequences is increasing exponentially, and accurate computational characterization of their function remains an unsolved challenge. We assembled a curated dataset of one thousand characterized TPS reactions and developed a method to devise highly accurate machine-learning models for functional annotation in a low-data regime. Our models significantly outperform existing methods for TPS detection and substrate prediction. By applying the models to large protein sequence databases, we discovered seven TPS enzymes previously undetected by state-of-the-art protein signatures and experimentally confirmed their activity, including the first reported TPSs in the major domain of life Archaea. Furthermore, we discovered a new TPS structural domain and distinct subtypes of previously known domains. This work demonstrates the potential of machine learning to speed up the discovery and characterization of novel TPSs.

Publisher

Cold Spring Harbor Laboratory

Reference87 articles.

1. Use of Terpenoids as Natural Flavouring Compounds in Food Industry

2. The discovery of artemisinin and the Nobel Prize in Physiology or Medicine

3. Liu, K. , Zuo, H. , Li, G. , Yu, H. & Hu, Y . Global research on artemisinin and its derivatives: Perspectives from patents. Pharmacol. Res. 159, 105048 (2020).

4. Sesquiterpenoids Lactones: Benefits to Plants and People

5. Euphorbia Diterpenes: Isolation, Structure, Biological Activity, and Synthesis (2008–2012);Chem. Rev,2014