A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning-Reference-Cited by-同舟云学术

A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning

Published:2019-01-01 Issue: Volume:2019 Page:
ISSN:1758-0463
Container-title:Database
language:en
Short-container-title:

Author:

Chen Tao¹,Wu Mingfen¹,Li Hexi¹

Affiliation:

1. Department of Computer Science and Engineering, Faculty of Intelligent Manufacturing, Wuyi University, No.22, Dongcheng village, Pengjiang district, Jiangmen City, Guangdong Province, 529020, China

Abstract

Abstract The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model. We propose using a pre-trained model and a fine-tuning technique to improve these approaches without additional time-consuming human labeling. Firstly, we show the architecture of Bidirectional Encoder Representations from Transformers (BERT), an approach for pre-training a model on large-scale unstructured text. We then combine BERT with a one-dimensional convolutional neural network (1d-CNN) to fine-tune the pre-trained model for relation extraction. Extensive experiments on three datasets, namely the BioCreative V chemical disease relation corpus, traditional Chinese medicine literature corpus and i2b2 2012 temporal relation challenge corpus, show that the proposed approach achieves state-of-the-art results (giving a relative improvement of 22.2, 7.77, and 38.5% in F1 score, respectively, compared with a traditional 1d-CNN classifier). The source code is available at https://github.com/chentao1999/MedicalRelationExtraction.

Funder

Guangdong Provincial Education Department

Guangdong Natural Science Foundation

Graduate Education Innovation

Integration of cloud computing and big data innovation project

Jiangmen foundation and theoretical science research project

Publisher

Oxford University Press (OUP)

Subject

General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,Information Systems

Link

http://academic.oup.com/database/article-pdf/doi/10.1093/database/baz116/31197246/baz116.pdf