MABERT: Mask-Attention-Based BERT for Chinese Event Extraction-Reference-Cited by-同舟云学术

MABERT: Mask-Attention-Based BERT for Chinese Event Extraction

Published:2023-07-20 Issue:7 Volume:22 Page:1-21
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Ding Ling¹^ORCID,Chen Xiaojun¹^ORCID,Wei Jian²^ORCID,Xiang Yang¹^ORCID

Affiliation:

1. Tongji University

2. Zhejiang University

Abstract

Event extraction is an essential but challenging task in information extraction. This task has considerably benefited from pre-trained language models, such as BERT. However, when it comes to the trigger-word mismatch problem in languages without natural delimiters, existing methods ignore the complement of lexical information to BERT. In addition, the inherent multi-role noise problem could limit the performance of methods when one sentence contains multiple events. In this article, we propose a Mask-Attention-based BERT (MABERT) framework for Chinese event extraction to address the above problems. Firstly, in order to avoid trigger-word mismatch and integrate lexical features into BERT layers directly, a mask-attention-based transformer augmented with two mask matrices is devised to replace the original one in BERT. By the mask-attention-based transformer, the character sequence interacts with external lexical semantics sufficiently and keeps its structure information at the same time. Moreover, against the multi-role noise problem, we make use of event type information from representation and classification, two aspects to enrich entity features, where type markers and event-schema-based mask matrix are proposed. Experimental results on the widely used ACE2005 dataset show the effectiveness of our proposed MABERT on Chinese event extraction task compared with other state-of-the-art methods.

Funder

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3597455

Reference41 articles.

1. Chen Chen and Vincent Ng. 2012. Joint modeling for Chinese event extraction with rich linguistic features. In Proceedings of COLING 2012. 529–544.

2. Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks

3. Pre-Training With Whole Word Masking for Chinese BERT

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Causal Knowledge Integrated with Attention for Interpretable Event Detection;2023 9th International Conference on Big Data and Information Analytics (BigDIA);2023-12-15

2. Modeling Character–Word Interaction via a Novel Mesh Transformer for Chinese Event Detection;Neural Processing Letters;2023-09-11