Author:
Fei Hao,Ren Yafeng,Zhang Yue,Ji Donghong,Liang Xiaohui
Abstract
Abstract
Biomedical information extraction (BioIE) is an important task. The aim is to analyze biomedical texts and extract structured information such as named entities and semantic relations between them. In recent years, pre-trained language models have largely improved the performance of BioIE. However, they neglect to incorporate external structural knowledge, which can provide rich factual information to support the underlying understanding and reasoning for biomedical information extraction. In this paper, we first evaluate current extraction methods, including vanilla neural networks, general language models and pre-trained contextualized language models on biomedical information extraction tasks, including named entity recognition, relation extraction and event extraction. We then propose to enrich a contextualized language model by integrating a large scale of biomedical knowledge graphs (namely, BioKGLM). In order to effectively encode knowledge, we explore a three-stage training procedure and introduce different fusion strategies to facilitate knowledge injection. Experimental results on multiple tasks show that BioKGLM consistently outperforms state-of-the-art extraction models. A further analysis proves that BioKGLM can capture the underlying relations between biomedical knowledge concepts, which are crucial for BioIE.
Funder
National Natural Science Foundation of China
Ministry of Education of China
Key Project of State Language Commission of China
Guangdong Basic and Applied Basic Research Foundation
Publisher
Oxford University Press (OUP)
Subject
Molecular Biology,Information Systems
Reference68 articles.
1. Identifying protein-protein interaction using tree lstm and structured attention;Ahmed,2019
2. Publicly available clinical BERT embeddings;Alsentzer;CoRR,2019
3. Automatic extraction of gene-disease associations from literature using joint ensemble learning;Bhasuran;PloS One,2018
4. Generalizing biomedical event extraction;Björne;Proceedings of the BioNLP Shared Task 2011 Workshop,2011
5. Biomedical event extraction using convolutional neural networks and dependency parsing;Björne;Proceedings of the BioNLP 2018 Workshop,2018
Cited by
59 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献