Author:
Chen Ping,Wu Fei,Wang Tong,Ding Wei
Abstract
Many Natural Language Processing and Computational Linguistics applications involve the generation of new texts based on some existing texts, such as summarization, text simplification and machine translation. However, there has been a serious problem haunting these applications for decades, that is, how to automatically and accurately assess quality of these applications. In this paper, we will present some preliminary results on one especially useful and challenging problem in NLP system evaluation---how to pinpoint content differences of two text passages (especially for large passages such as articles and books). Our idea is intuitive and very different from existing approaches. We treat one text passage as a small knowledge base, and ask it a large number of questions to exhaustively identify all content points in it. By comparing the correctly answered questions from two text passages, we will be able to compare their content precisely. The experiment using 2007 DUC summarization corpus clearly shows promising results.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Analysis of Extractive Text Summarisation Techniques for Multilingual Texts;2023 International Conference on Advances in Computing, Communication and Applied Informatics (ACCAI);2023-05-25
2. LowEST: a low resource semantic text summarization method for big data;Innovations in Systems and Software Engineering;2022-12-09
3. Interdisciplinarity in Cognitive Science: A Document Similarity Analysis;Cognitive Science;2022-12
4. WIDAR - Weighted Input Document Augmented ROUGE;Lecture Notes in Computer Science;2022
5. Automatic Text Summarization in Natural Language Processing;2021 IEEE International Conference on Mobile Networks and Wireless Communications (ICMNWC);2021-12-03