Author:
ANDO RIE,BOGURAEV BRANIMIR,BYRD ROY,NEFF MARY
Abstract
This paper describes a novel approach to multi-document summarization, which explicitly addresses the problem of detecting, and retaining for the summary, multiple themes in document collections. We place equal emphasis on the processes of theme identification and theme presentation. For the former, we apply Iterative Residual Rescaling (IRR); for the latter, we argue for graphical display elements. IRR is an algorithm designed to account for correlations between words and to construct multi-dimensional topical space indicative of relationships among linguistic objects (documents, phrases, and sentences). Summaries are composed of objects with certain properties, derived by exploiting the many-to-many relationships in such a space. Given their inherent complexity, our multi-faceted summaries benefit from a visualization environment. We discuss some essential features of such an environment.
Publisher
Cambridge University Press (CUP)
Subject
Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Protein-Based Hydroxyapatite Materials: Tuning Composition toward Biomedical Applications;ACS Applied Bio Materials;2020-04-13
2. Automatic summarisation: 25 years On;Natural Language Engineering;2019-09-19
3. DClusterE;ACM Transactions on Intelligent Systems and Technology;2012-02
4. Text summarisation in progress: a literature review;Artificial Intelligence Review;2011-04-30
5. iDVS: An Interactive Multi-document Visual Summarization System;Machine Learning and Knowledge Discovery in Databases;2011