MWI-Sum-Reference-Cited by-同舟云学术

MWI-Sum

Published:2015-10 Issue:1 Volume:34 Page:1-35
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Baralis Elena¹,Cagliero Luca¹,Fiori Alessandro²,Garza Paolo¹

Affiliation:

1. Politecnico di Torino, Torino (Italy)

2. IRCC: Institute for Cancer Research at Candiolo, Strada Provinciale, Candiolo (Italy)

Abstract

Multidocument summarization addresses the selection of a compact subset of highly informative sentences, i.e., the summary, from a collection of textual documents. To perform sentence selection, two parallel strategies have been proposed: (a) apply general-purpose techniques relying on data mining or information retrieval techniques, and/or (b) perform advanced linguistic analysis relying on semantics-based models (e.g., ontologies) to capture the actual sentence meaning. Since there is an increasing need for processing documents written in different languages, the attention of the research community has recently focused on summarizers based on strategy (a). This article presents a novel multilingual summarizer, namely MWI-Sum (Multilingual Weighted Itemset-based Summarizer), that exploits an itemset-based model to summarize collections of documents ranging over the same topic. Unlike previous approaches, it extracts frequent weighted itemsets tailored to the analyzed collection and uses them to drive the sentence selection process. Weighted itemsets represent correlations among multiple highly relevant terms that are neglected by previous approaches. The proposed approach makes minimal use of language-dependent analyses. Thus, it is easily applicable to document collections written in different languages. Experiments performed on benchmark and real-life collections, English-written and not, demonstrate that the proposed approach performs better than state-of-the-art multilingual document summarizers.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/2809786

Reference69 articles.

1. Rhetorics-based multi-document summarization

2. Generation and Evaluation of Summaries of Academic Teaching Materials

3. Multi-document summarization exploiting frequent itemsets

4. Multi-document summarization based on the Yago ontology

5. GraphSum: Discovering correlations among multiple terms for graph-based summarization

Cited by 37 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Accelerating large-scale weighted similarity queries based on external storage;Information Systems;2023-07

2. Enhancing metaheuristic based extractive text summarization with fuzzy logic;Neural Computing and Applications;2023-02-02

3. Extractive single-document summarization using adaptive binary constrained multi-objective differential evaluation;Innovations in Systems and Software Engineering;2022-08-09

4. Frequent item-set mining and clustering based ranked biomedical text summarization;The Journal of Supercomputing;2022-07-04

5. A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis;IEEE Access;2022