Affiliation:
1. Computer Science and Engineering Department, Jadavpur University, Kolkata, India
Abstract
With the rapid growth of the World Wide Web, information overload is becoming a problem for an increasingly large number of people. Since summarization helps human to digest the main contents of a text document very rapidly, there is a need for an effective and powerful tool that can automatically summarize text. In this paper, we present a keyphrase based approach to single document summarization that extracts first a set of keyphrases from a document, use the extracted keyphrases to choose sentences from the document and finally form an extractive summary with the chosen sentences. We view keyphrases (single or multi-word) as the important concepts and we assume that an extractive summary of a document is an elaboration of the important concepts contained in the document to some permissible extent and it is controlled by the given summary length. We have tested our proposed keyphrase-based summarization approach on two different datasets: one for English and another for Bengali. The experimental results show that the performance of the proposed system is comparable to some state-of-the art summarization systems.
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Analyzing the Effects of Transcription Errors on Summary Generation of Bengali Spoken Documents;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-08-16
2. Bangla Text Summarization Analysis Using Machine Learning: An Extractive Approach;Lecture Notes in Mechanical Engineering;2024
3. Intelligent Combination of Approaches Towards Improved Bangla Text Summarization;2023 International Conference on Applied Intelligence and Sustainable Computing (ICAISC);2023-06-16
4. Enhanced Model for Automatic Tamil Text Summarization;2023 International Conference on Sustainable Computing and Data Communication Systems (ICSCDS);2023-03-23
5. Predicting Word Importance Using a Support Vector Regression Model for Multi-document Text Summarization;Advances in Data-Driven Computing and Intelligent Systems;2023