A Comprehensive Evaluation of Large Language Models in Mining Gene Interactions and Pathway Knowledge-Reference-Cited by-同舟云学术

A Comprehensive Evaluation of Large Language Models in Mining Gene Interactions and Pathway Knowledge

Published:2024-01-24 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Azam Muhammad,Chen Yibo,Arowolo Micheal Olaolu,Liu Haowang,Popescu Mihail,Xu Dong^ORCID

Abstract

AbstractBackgroundUnderstanding complex biological pathways, including gene-gene interactions and gene regulatory networks, is critical for exploring disease mechanisms and drug development. Manual literature curation of biological pathways is useful but cannot keep up with the exponential growth of the literature. Large-scale language models (LLMs), notable for their vast parameter sizes and comprehensive training on extensive text corpora, have great potential in automated text mining of biological pathways.MethodThis study assesses the effectiveness of 21 LLMs, including both API-based models and open-source models. The evaluation focused on two key aspects: gene regulatory relations (specifically, ‘activation’, ‘inhibition’, and ‘phosphorylation’) and KEGG pathway component recognition. The performance of these models was analyzed using statistical metrics such as precision, recall, F1 scores, and the Jaccard similarity index.ResultsOur results indicated a significant disparity in model performance. Among the API-based models, ChatGPT-4 and Claude-Pro showed superior performance, with an F1 score of 0.4448 and 0.4386 for the gene regulatory relation prediction, and a Jaccard similarity index of 0.2778 and 0.2657 for the KEGG pathway prediction, respectively. Open-source models lagged their API-based counterparts, where Falcon-180b-chat and llama1-7b led with the highest performance in gene regulatory relations (F1 of 0.2787 and 0.1923, respectively) and KEGG pathway recognition (Jaccard similarity index of 0.2237 and 0. 2207, respectively).ConclusionLLMs are valuable in biomedical research, especially in gene network analysis and pathway mapping. However, their effectiveness varies, necessitating careful model selection. This work also provided a case study and insight into using LLMs as knowledge graphs.

Publisher

Cold Spring Harbor Laboratory

Reference45 articles.

1. Mapping biological process relationships and disease perturbations within a pathway network;NPJ systems biology and applications,2018

2. KEGG: Kyoto Encyclopedia of Genes and Genomes

3. Li, Y. , Xu, H. , Zhao, H. , Guo, H. , and Liu, S. (2023) Chatpathway: Conversational large language models for biology pathway detection. In: NeurIPS 2023 AI for Science Workshop.

4. Liu, X. , McDuff, D. , Kovacs, G. , Galatzer-Levy, I. , Sunshine, J. , Zhan, J. , Poh, M.-Z. , Liao, S. , Di Achille, P. , and Patel, S. (2023) Large language models are few-shot health learners. arXiv preprint arXiv:230515525.

5. Li, J. , Sun, Y. , Johnson, R. J. , Sciaky, D. , Wei, C.-H. , Leaman, R. , Davis, A. P. , Mattingly, C. J. , Wiegers, T. C. , and Lu, Z. (2016) Biocreative v cdr task corpus: A resource for chemical disease relation extraction. Database. 2016,

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GeneRAG: Enhancing Large Language Models with Gene-Related Task by Retrieval-Augmented Generation;2024-06-28

2. Prototyping an Ontological Framework for Cellular Senescence Mechanisms: A Homeostasis Imbalance Perspective;Scientific Data;2024-05-10