Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression-Reference-Cited by-同舟云学术

Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression

Published:2024-07-10 Issue:7 Volume:12 Page:1535
ISSN:2227-9059
Container-title:Biomedicines
language:en
Short-container-title:Biomedicines

Author:

Bhasuran Balu¹^ORCID,Manoharan Sharanya²^ORCID,Iyyappan Oviya Ramalakshmi³,Murugesan Gurusamy⁴,Prabahar Archana⁵^ORCID,Raja Kalpana⁶^ORCID

Affiliation:

1. School of Information, Florida State University, Tallahassee, FL 32306, USA

2. Department of Bioinformatics, Stella Maris College, Chennai 600086, Tamil Nadu, India

3. Department of Computer Science and Engineering, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Chennai 641112, Tamil Nadu, India

4. Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Green Fields, Guntur District, Vaddeswaram 522302, Andhra Pradesh, India

5. Center for Gene Regulation in Health and Disease, Department of Biological, Geological, and Environmental Sciences (BGES), Cleveland State University, Cleveland, OH 44115, USA

6. Department of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT 06510, USA

Abstract

microRNA (miRNA)–messenger RNA (mRNA or gene) interactions are pivotal in various biological processes, including the regulation of gene expression, cellular differentiation, proliferation, apoptosis, and development, as well as the maintenance of cellular homeostasis and pathogenesis of numerous diseases, such as cancer, cardiovascular diseases, neurological disorders, and metabolic conditions. Understanding the mechanisms of miRNA–mRNA interactions can provide insights into disease mechanisms and potential therapeutic targets. However, extracting these interactions efficiently from a huge collection of published articles in PubMed is challenging. In the current study, we annotated a miRNA–mRNA Interaction Corpus (MMIC) and used it for evaluating the performance of a variety of machine learning (ML) models, deep learning-based transformer (DLT) models, and large language models (LLMs) in extracting the miRNA–mRNA interactions mentioned in PubMed. We used the genomics approaches for validating the extracted miRNA–mRNA interactions. Among the ML, DLT, and LLM models, PubMedBERT showed the highest precision, recall, and F-score, with all equal to 0.783. Among the LLM models, the performance of Llama-2 is better when compared to others. Llama 2 achieved 0.56 precision, 0.86 recall, and 0.68 F-score in a zero-shot experiment and 0.56 precision, 0.87 recall, and 0.68 F-score in a three-shot experiment. Our study shows that Llama 2 achieves better recall than ML and DLT models and leaves space for further improvement in terms of precision and F-score.

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-9059/12/7/1535/pdf

Reference37 articles.

1. Mechanisms of miRNA-Mediated Gene Regulation from Common Downregulation to mRNA-Specific Upregulation;Valinezhad;Int. J. Genom. Proteom.,2014

2. O’Brien, J., Hayder, H., Zayed, Y., and Peng, C. (2018). Overview of MicroRNA Biogenesis, Mechanisms of Actions, and Circulation. Front. Endocrinol., 9.

3. Leitão, A.L., and Enguita, F.J. (2022). A Structural View of miRNA Biogenesis and Function. Non-Coding RNA, 8.

4. Dynamic miRNA–mRNA paradigms: New faces of miRNAs;Ni;Biochem. Biophy. Rep.,2015

5. MicroRNA function: Multiple mechanisms for a tiny RNA?;Pillai;RNA,2005