Affiliation:
1. Max Planck Institute for Molecular Genetics Otto Warburg Laboratory Berlin Germany
2. Zhengzhou Tobacco Research Institute of China National Tobacco Corporation Zhengzhou China
3. CAS Key Laboratory of Quantitative Engineering Biology Shenzhen Institute of Synthetic Biology Shenzhen Institute of Advanced Technology Chinese Academy of Sciences Shenzhen China
Abstract
AbstractThe transforming growth factor‐β (TGF‐β) superfamily, including Nodal and Activin, plays a critical role in various cellular processes. Understanding the intricate regulation and gene expression dynamics of TGF‐β signalling is of interest due to its diverse biological roles. A machine learning approach is used to predict gene expression patterns induced by Activin using features, such as histone modifications, RNA polymerase II binding, SMAD2‐binding, and mRNA half‐life. RNA sequencing and ChIP sequencing datasets were analysed and differentially expressed SMAD2‐binding genes were identified. These genes were classified into activated and repressed categories based on their expression patterns. The predictive power of different features and combinations was evaluated using logistic regression models and their performances were assessed. Results showed that RNA polymerase II binding was the most informative feature for predicting the expression patterns of SMAD2‐binding genes. The authors provide insights into the interplay between transcriptional regulation and Activin signalling and offers a computational framework for predicting gene expression patterns in response to cell signalling.
Funder
China Scholarship Council
Publisher
Institution of Engineering and Technology (IET)