Author:
Jiang Ming,Huang Tao,Zhang Min,Tang Jingfan,Liu Zhiyong
Abstract
Various methods have been proposed recently to solve the problem of weak domain adaptability of Chinese word segmentation (CWS) models based on neural networks. However, although some of these improved models achieve high segmentation accuracy in a specific domain, they need to be retrained when applied to another. After rethinking the domain adaptability, two criteria, including the segmentation accuracy and the universality, are suggested for measuring it. Taking the above two criteria into consideration, an improved neural-based CWS model is proposed, which incorporates the common lexicon and unlabeled data into BERT. To make the most use of lexicon, a new method is proposed to construct the lexicon-based feature vector. In addition, the domain-specific words can be effectively extracted by pre-training a language model on the unlabeled data. Finally, a GRU-like gate structure is used to integrate the lexicon-based feature vector and language model into BERT. Experiments on five different domains reveal that the domain adaptability of this model is very strong.
Subject
Computational Mathematics,Computer Science Applications,General Engineering
Reference26 articles.
1. Deep learning for Chines word segmentation and POS tagging;Zheng;Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing,2013
2. Long short-term memory neural networks for Chinese word segmentation;Chen;Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing,2015
3. Bi-directional LSTM recurrent neural network for Chinese word segmentation;Yao;International Conference on Neural Information Processing. Springer, Cham,2016
4. Unsupervised domain adaptation for joint segmentation and POS-tagging;Liu;Proceedings of COLING 2012: Posters,2012
5. Y. Zhang, J. Xu, G. Miao et al., Addressing domain adaptation for Chinese word segmentation with instances-based transfer learning, Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. Springer, Cham. 2018, pp. 24–36.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献