Affiliation:
1. University College of Engineering and Technology, Bikaner, India
2. Netaji Subhas University of Technology, New Delhi, India
Abstract
Tables are commonly used for effective and compact representation of relational information across the data in diverse document classes like scientific papers, financial statements, newspaper articles, invoices, or product descriptions. However, table structure detection is a relatively simple process for humans, but recognizing precise table structure is still a computer vision challenge. Further, innumerable possible table layouts increase the risk of automatic topic modeling and understanding the capability of each table from the generic document. This paper develops the framework to recognize the table structure from the Compound Document Image(CDI). Initially, the bilateral filter is designed for image transformation, enhancing CDI quality. An improved binarization-Sauvola algorithm (IBSA) is proposed to degrade the tables with uneven illumination, low contrast, and uniform background. The morphological Thinning method extracts the line from the table. The masking approach extracts the row and column from the table. Finally, the ResNet Attention model optimized over Black Widow optimization-based mutual exclusion (BWME) is developed to recognize the table structure from the document images. The UNLV, TableBank, and ICDAR-2013 table competition datasets are used to evaluate the proposed framework’s performance. Precision and accuracy are the metrics considered for evaluating the proposed framework performance. From the experimental results, the proposed framework achieved a precision value of 96.62 and the accuracy value of 94.34, which shows the effectiveness of the proposed approach’s performance.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference12 articles.
1. Table detection using deep learning;Gilani;IEEE,2017
2. Robust table recognition for printed document images;Liang;Mathematical Biosciences and Engineering,2020
3. Quantum machine learning;Biamonte;Nature,2017
4. Split, embed and merge: An accurate table structure recognizer;Zhang;Pattern Recognition,2022
5. Image driven machine learning methods for microstructure recognition;Chowdhury;Computational Materials Science,2016