Author:
Dwivedi Gaurav,Khandelwal Monika,Rout Ranjeet Kumar,Umer Saiyed,Mallik Saurav,Qin Hong
Abstract
AbstractProtein methylation is a vital regulator of many biological processes at the post-translational level, and accurate prediction of protein methylation sites is essential for research and drug discovery. In this paper, we present a new method, namely RMSxAI, to predict the arginine methylation sites from primary sequences using machine learning algorithms and describe the predictions using explainable artificial intelligence (XAI) techniques. Leveraging experimentally validated methylated and unmethylated protein sequences from diverse organisms, we deduced several sequence features, encompassing physicochemical properties, amino acid composition, and evolutionary insights. Our results show that the proposed RMSxAI can predict protein methylation sites with high accuracy, bringing the F1 score up to 0.88 and overall accuracy up to 88.4%. We use various XAI methods to explain the output results. These include key features, partial occupancy maps, and local variation models that provide insight into key features and interactions that lead to predictions. Overall, our approach is relevant to research and drug discovery, and our results demonstrate the potential of machine learning algorithms and XAI methods to provide accurate and meaningful prediction of arginine methylation sites.
Funder
National Science Foundation
Publisher
Springer Science and Business Media LLC
Reference49 articles.
1. Longo VD, Kennedy BK. Sirtuins in aging and age-related disease. Cell. 2006;126(2):257–68.
2. Chen X, Niroomand F, Liu Z, Zankl A, Katus H, Jahn L, Tiefenbacher C. Expression of nitric oxide related enzymes in coronary heart disease. Basic Res Cardiol. 2006;101:346–53.
3. Wang Y, Zhang S, Li F, Zhou Y, Zhang Y, Wang Z, Zhang R, Zhu J, Ren Y, Tan Y, et al. Therapeutic target database 2020: enriched resource for facilitating research and early development of targeted therapeutics. Nucleic Acids Res. 2020;48(D1):D1031–41.
4. Liu C, Chyr J, Zhao W, Xu Y, Ji Z, Tan H, Soto C, Zhou X, Initiative ADN. Genome-wide association and mechanistic studies indicate that immune response contributes to Alzheimer’s disease development. Front Genet. 2018;9:410.
5. Suzuki A, Yamada R, Yamamoto K. Citrullination by peptidylarginine deiminase in rheumatoid arthritis. Ann N Y Acad Sci. 2007;1108(1):323–39.