Author:
Lou Yinxia,Zhu Xun,Tan Kai
Abstract
AbstractBiomedical named entity recognition (BioNER) is an essential task in biomedical information analysis. Recently, deep neural approaches have become widely utilized for BioNER. Biomedical dictionaries, implemented through a masked manner, are frequently employed in these methods to enhance entity recognition. However, their performance remains limited. In this work, we propose a dictionary-based matching graph network for BioNER. This approach utilizes the matching graph method to project all possible dictionary-based entity combinations in the text onto a directional graph. The network is implemented coherently with a bi-directional graph convolutional network (BiGCN) that incorporates the matching graph information. Our proposed approach fully leverages the dictionary-based matching graph instead of a simple masked manner. We have conducted numerous experiments on five typical Bio-NER datasets. The proposed model shows significant improvements in F1 score compared to the state-of-the-art (SOTA) models: 2.8% on BC2GM, 1.3% on BC4CHEMD, 1.1% on BC5CDR, 1.6% on NCBI-disease, and 0.5% on JNLPBA. The results show that our model, which is superior to other models, can effectively recognize natural biomedical named entities.
Funder
National Natural Science Foundation of China
Doctor Scientific Research Fund of Jianghan University
Publisher
Springer Science and Business Media LLC
Reference25 articles.
1. Murugesan, G., Abdulkadhar, S., Bhasuran, B. & Natarajan, J. BCC-NER: Bidirectional, contextual clues named entity tagger for gene/protein mention recognition. EURASIP J. Bioinf. Syst. Biol. 2017, 7 (2017).
2. Lai, T., Ji, H., Zhai, C., & Tran, Q.H. Joint biomedical entity and relation extraction with knowledge-enhanced collective inference. Preprint at arXiv:2105.13456, (2021).
3. Abboud, R., Ceylan, I., Lukasiewicz, T. & Salvatori, T. Boxe: A box embedding model for knowledge base completion. Adv. Neural. Inf. Process. Syst. 33, 9649–9661 (2020).
4. Huang, C. C. & Lu, Z. Community challenges in biomedical text mining over 10 years: Success, failure and the future. Brief. Bioinform. 17(1), 132–144 (2016).
5. Liu, L., et al. Empower sequence labeling with task-aware neural language model. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 32, (2018).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献