Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition-Reference-Cited by-同舟云学术

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition

Published:2023-07-20 Issue:7 Volume:22 Page:1-19
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Yan Yibo¹^ORCID,Zhu Peng²^ORCID,Cheng Dawei¹^ORCID,Yang Fangzhou³^ORCID,Luo Yifeng²^ORCID

Affiliation:

1. Department of Computer Science and Technology, Tongji University, China

2. School of Data Science and Engineering, East China Normal University, China

3. Group of Artificial Intelligence and Big Data, Seek Data Inc., China

Abstract

Named entity recognition (NER) is a fundamental task for information extraction applications. NER is challenging because of semantic ambiguities in academic literature, especially for non-Latin languages. Besides word semantic information, recognizing Chinese named entities needs to consider word boundary information, as words contained in Chinese texts are not separated with spaces. Leveraging word boundary information could help to determine entity boundaries and thus improve entity recognition performance. In this article, we propose to combine word boundary information and semantic information for named entity recognition based on multi-task adversarial learning. Specifically, we learn commonly shared boundary information of entities from multiple kinds of tasks, including Chinese word segmentation (CWS), part-of-speech (POS) tagging, and entity recognition, with adversarial learning. We learn task-specific semantic information of words from these tasks and combine the learned boundary information with the semantic information to improve entity recognition with multi-task learning. We then propose a compression method based on improved clustering to accelerate the proposed model. We conduct extensive experiments on four public benchmark datasets and two private datasets, compared with state-of-the-art baseline models, and the experimental results demonstrate that our model achieves considerable performance improvements on various evaluation datasets.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3603626

Reference47 articles.

1. Enriching Word Vectors with Subword Information

2. Razvan C. Bunescu and Raymond J. Mooney. 2005. A shortest path dependency kernel for relation extraction. In HLT-EMNLP. 724–731.

3. Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, and Shengping Liu. 2018. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In EMNLP. 182–192.

4. Wanxiang Che, Mengqiu Wang, Christopher D. Manning, and Ting Liu. 2013. Named entity recognition with bilingual constraints. In NAACL. 52–62.

5. Aitao Chen, Fuchun Peng, Roy Shan, and Gordon Sun. 2006. Chinese named entity recognition with conditional probabilistic models. In SIGHAN. 173–176.