Affiliation:
1. SRM Institute of Science and Technology
Abstract
With the explosion of unstructured textual data circulating the digital space in present times, there has been an increase in the necessity of developing tools that can perform automatic text summarization to allow people to get insights from them easily and extract significant and essential data using Automatic Text Summarizers. The readability of documents can be improved and the time spent on researching for information can be improved by the implementation of text summarization tools. In this project, extractive summarization will be performed on text recognized from scanned documents via Optical Character Recognition (OCR), using the TextRank algorithm which is an unsupervised text summarization technique for performing extractive text summarization.
Publisher
Trans Tech Publications Ltd
Reference10 articles.
1. ÁNGEL HERNANDEZ-CASTANEDA, RENE ARNULFO GARCIA-HERNANDEZ, YULIA LEDENEVA, CHRISTIAN EDUARDO MILLAN- HERNANDEZ, Language-independent extractive automatic text summarization based on automatic keyword extraction, In Computer Speech & Language, Volume 71, January 2022, 101267, Elsevier.
2. ANSHUL ARORA, RAJAT SINGH, ASHIQ EQBAL, ANKIT MANGAL, PROF. S. U. SOUJI, Extraction and Detection of Text From Images,, In International Journal of Research in Engineering and Technology Vol. 8, August (2021).
3. MINGXI ZHANG, XUEMIN LI, SHUIBO YUE, AND LIUQIAN YANG, An Empirical Study of TextRank for Keyword Extraction,, In IEEE Access(2020).
4. M. F. MRIDHA, AKLIMA AKTER LIMA, KAMRUDDIN NUR, SUJOY CHANDRA DAS, MAHMUD HASAN, AND MUHAMMAD MOHSIN KABIR, A survey of Automatic Text Summarization: Progress, Process and Challenges,, In IEEE Access November 22, (2021).
5. JINYUAN ZHAO, YANNA WANG, BAIHUA XIAO, CUNZHAO SHI, FUXI JIA, AND CHUNHENG WANG, DetectGAN: GAN-based text detector for camera-captured document Images, In International Journal on Document Analysis and Recognition (IJDAR), Springer (2020).