A Comparative Analysis on Hindi and English Extractive Text Summarization-Reference-Cited by-同舟云学术

A Comparative Analysis on Hindi and English Extractive Text Summarization

Published:2019-07-24 Issue:3 Volume:18 Page:1-39
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Verma Pradeepika¹,Pal Sukomal²,Om Hari¹

Affiliation:

1. Indian Institute of Technology (Indian School of Mines) Dhanbad, Dhanbad, India

2. Indian Institute of Technology (Banaras Hindu University) varanasi, Varanasi, India

Abstract

Text summarization is the process of transfiguring a large documental information into a clear and concise form. In this article, we present a detailed comparative study of various extractive methods for automatic text summarization on Hindi and English text datasets of news articles. We consider 13 different summarization techniques, namely, TextRank, LexRank, Luhn, LSA, Edmundson, ChunkRank, TGraph, UniRank, NN-ED, NN-SE, FE-SE, SummaRuNNer, and MMR-SE, and we evaluate their performance using various performance metrics, such as precision, recall, F 1 , cohesion, non-redundancy, readability, and significance. A thorough analysis is done in eight different parts that exhibits the strengths and limitations of these methods, effect of performance over the summary length, impact of language of a document, and other factors as well. A standard summary evaluation tool (ROUGE) and extensive programmatic evaluation using Python 3.5 in Anaconda environment are used to evaluate their outcome.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3308754

Reference49 articles.

1. The Automatic Creation of Literature Abstracts

2. A survey on automatic text summarization;Das Dipanjan;Lit. Survey Lang. Stat.,2007

3. Text summarization with harmony search algorithm-based sentence extraction

4. Generating Coherent Summaries of Scientific Articles Using Coherence Patterns

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal sentiment analysis of english and hinglish memes;Multimedia Tools and Applications;2024-06-20

2. Exploring Text Summarization Techniques: A Review of Current Challenges and Future Directions;2024 2nd International Conference on Disruptive Technologies (ICDT);2024-03-15

3. Emotional and Mental Nuances and Technological Approaches: Optimising Fact-Check Dissemination through Cognitive Reinforcement Technique;Electronics;2024-01-04

4. Analysis and Performance of Text Summarization Tools Applied on Indian Languages;Lecture Notes in Electrical Engineering;2024

5. Text Summarization Techniques for the Bengali Language: Survey;Lecture Notes in Electrical Engineering;2024