Affiliation:
1. School of Business, Guilin University of Electronic Technology, Guilin, Guangxi 541004, China
2. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, Guangxi 541004, China
Abstract
Discourse coherence is strongly associated with text quality, making it important to natural language generation and understanding. However, existing coherence models focus on measuring individual aspects of coherence, such as lexical overlap, entity centralization, rhetorical structure, etc., lacking measurement of the semantics of text. In this paper, we propose a discourse coherence analysis method combining sentence embedding and the dimension grid, we obtain sentence-level vector representation by deep learning, and we introduce a coherence model that captures the fine-grained semantic transitions in text. Our work is based on the hypothesis that each dimension in the embedding vector is exactly assigned a stated certainty and specific semantic. We take every dimension as an equal grid and compute its transition probabilities. The document feature vector is also enriched to model the coherence. Finally, the experimental results demonstrate that our method achieves excellent performance on two coherence-related tasks.
Funder
National Natural Science Foundation of China
Subject
Multidisciplinary,General Computer Science
Reference29 articles.
1. Evaluation of text coherence for electronic essay scoring systems
2. Using entity-based features to model coherence in student essays;J. Burstein
3. Expressing an image stream with a sequence of natural sentences;C. Park
4. Globally coherent text generation with neural checklist models;C. Kiddon
5. Modeling local coherence: an entity-based approach;R. Barzilay;Computational Linguistics,2015
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献