Affiliation:
1. School of Information Science and Technology, Dalian Maritime University , Dalian 116026, China
2. School of Computer Science and Technology, Dalian University of Technology , Dalian 116024, China
Abstract
Abstract
Motivation
Biomedical relation extraction at the document level (Bio-DocRE) involves extracting relation instances from biomedical texts that span multiple sentences, often containing various entity concepts such as genes, diseases, chemicals, variants, etc. Currently, this task is usually implemented based on graphs or transformers. However, most work directly models entity features to relation prediction, ignoring the effectiveness of entity pair information as an intermediate state for relation prediction. In this article, we decouple this task into a three-stage process to capture sufficient information for improving relation prediction.
Results
We propose an innovative framework HTGRS for Bio-DocRE, which constructs a hierarchical tree graph (HTG) to integrate key information sources in the document, achieving relation reasoning based on entity. In addition, inspired by the idea of semantic segmentation, we conceptualize the task as a table-filling problem and develop a relation segmentation (RS) module to enhance relation reasoning based on the entity pair. Extensive experiments on three datasets show that the proposed framework outperforms the state-of-the-art methods and achieves superior performance.
Availability and implementation
Our source code is available at https://github.com/passengeryjy/HTGRS.
Funder
Natural Science Foundation of China
Publisher
Oxford University Press (OUP)