Affiliation:
1. Wuhan University of Science and Technology, Wuhan, Hubei, China
Abstract
By leveraging self-supervised tasks,
pre-trained language model (PLM)
has made significant progress in the field of
machine reading comprehension (MRC)
. However, in
classical Chinese MRC (CCMRC)
, the passage is typically in classical style, but the question and options are given in modern style. Existing pre-trained methods seldom model the relationship between classical and modern styles, resulting in overall misunderstanding of the passage. In this paper, we propose a contrastive learning method between classical and modern Chinese in order to reach a deep understanding of the two different styles. In particular, a novel pre-training task and an enhanced co-matching network have been defined: (1) The
synonym discrimination (SD)
task is used to identify whether modern meaning corresponds to classical Chinese. (2) The
enhanced dual co-matching (EDCM)
network is employed for a more interactive understanding of the classical passage and the modern options. The experimental results show that our proposed method improves language understanding ability and outperforms existing PLMs on the Haihua, CCLUE, and ChID datasets.
Funder
Major Projects of National Social Science Foundation of China
Guizhou Provincial Science and Technology Projects
Publisher
Association for Computing Machinery (ACM)
Reference46 articles.
1. Kevin Clark, Minh-Thang Luong, Quoc V. Le, and Christopher D. Manning. 2020. ELECTRA: Pre-training text encoders as discriminators rather than generators. In Proceedings of the 8th International Conference on Learning Representations (ICLR’20). https://openreview.net/forum?id=r1xMH1BtvB.
2. Pre-Training With Whole Word Masking for Chinese BERT
3. Interactive Gated Decoder for Machine Reading Comprehension
4. A Span-Extraction Dataset for Chinese Machine Reading Comprehension
5. Hendrik Dahlkamp, Adrian Kaehler, David Stavens, Sebastian Thrun, and Gary R. Bradski. 2006. Self-supervised monocular road detection in desert terrain. In Robotics: Science and Systems, Vol. 38.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献