Multi-Document Summarization by Extended Graph Text Representation and Importance Refinement-Reference-Cited by-同舟云学术

Multi-Document Summarization by Extended Graph Text Representation and Importance Refinement

Published:2014 Issue: Volume: Page:28-53
ISSN:2327-1981
Container-title:Advances in Data Mining and Database Management
language:
Short-container-title:

Author:

Mirchev Uri¹,Last Mark¹

Affiliation:

1. Ben Gurion University of the Negev, Israel

Abstract

Automatic multi-document summarization is aimed at recognizing important text content in a collection of topic-related documents and representing it in the form of a short abstract or extract. This chapter presents a novel approach to the multi-document summarization problem, focusing on the generic summarization task. The proposed SentRel (Sentence Relations) multi-document summarization algorithm assigns importance scores to documents and sentences in a collection based on two aspects: static and dynamic. In the static aspect, the significance score is recursively inferred from a novel, tripartite graph representation of the text corpus. In the dynamic aspect, the significance score is continuously refined with respect to the current summary content. The resulting summary is generated in the form of complete sentences exactly as they appear in the summarized documents, ensuring the summary's grammatical correctness. The proposed algorithm is evaluated on the TAC 2011 dataset using DUC 2001 for training and DUC 2004 for parameter tuning. The SentRel ROUGE-1 and ROUGE-2 scores are comparable to state-of-the-art summarization systems, which require a different set of textual entities.

Publisher

IGI Global

Reference17 articles.

1. LexRank: Graph-based lexical centrality as salience in text summarization.;G.Erkan;Journal of Artificial Intelligence Research,2004

2. Giannakopoulos, G., El-Haj, M., Favre, B., Litvak, M., Steinberger, J., & Varma, V. (2011). TAC 2011 multiling pilot overview. In Proceedings of Text Analysis Conference (TAC-2011). National Institute of Standards and Technology.

3. Kleinberg, J. (1998). Authoritative sources in a hyperlinked environment. In Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '98). ACM.

4. Lin, C. (2004). ROUGE: A package for automatic evaluation of summaries. In Proceedings of the ACL-04 Workshop (pp. 74-81). Association for Computational Linguistics.

5. Lin, H., & Bilmes, J. (2010). Multi-document summarization via budgeted maximization of submodular functions. In Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT '10) (pp. 912-920). Stroudsburg, PA: Association for Computational Linguistics.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An abstractive summary generation system for customer reviews and news article using deep learning;Journal of Ambient Intelligence and Humanized Computing;2020-08-03

2. Multi-document extractive summarization using semantic graph;PROCES LENG NAT;2019

3. Social Data Sentiment Analysis of a Multilingual Dataset: A Case Study with Malayalam and English;Communications in Computer and Information Science;2019

4. Integration of Different Analytical Concepts on Multimedia Contents in Service of Intelligent Knowledge Extraction;Intelligent Analysis of Multimedia Information;2017

5. Erratum to: Multilingual Sentiment Analysis: State of the Art and Independent Comparison of Techniques;Cognitive Computation;2016-07-30