CasTabDetectoRS: Cascade Network for Table Detection in Document Images with Recursive Feature Pyramid and Switchable Atrous Convolution-Reference-Cited by-同舟云学术

CasTabDetectoRS: Cascade Network for Table Detection in Document Images with Recursive Feature Pyramid and Switchable Atrous Convolution

Published:2021-10-16 Issue:10 Volume:7 Page:214
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Hashmi Khurram Azeem^ORCID,Pagani Alain,Liwicki Marcus^ORCID,Stricker Didier,Afzal Muhammad Zeshan^ORCID

Abstract

Table detection is a preliminary step in extracting reliable information from tables in scanned document images. We present CasTabDetectoRS, a novel end-to-end trainable table detection framework that operates on Cascade Mask R-CNN, including Recursive Feature Pyramid network and Switchable Atrous Convolution in the existing backbone architecture. By utilizing a comparativelyightweight backbone of ResNet-50, this paper demonstrates that superior results are attainable without relying on pre- and post-processing methods, heavier backbone networks (ResNet-101, ResNeXt-152), and memory-intensive deformable convolutions. We evaluate the proposed approach on five different publicly available table detection datasets. Our CasTabDetectoRS outperforms the previous state-of-the-art results on four datasets (ICDAR-19, TableBank, UNLV, and Marmot) and accomplishes comparable results on ICDAR-17 POD. Upon comparing with previous state-of-the-art results, we obtain a significant relative error reduction of 56.36%, 20%, 4.5%, and 3.5% on the datasets of ICDAR-19, TableBank, UNLV, and Marmot, respectively. Furthermore, this paper sets a new benchmark by performing exhaustive cross-datasets evaluations to exhibit the generalization capabilities of the proposed method.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging

Link

https://www.mdpi.com/2313-433X/7/10/214/pdf

Reference74 articles.

1. A Survey of Graphical Page Object Detection with Deep Neural Networks

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robust page object detection network for heterogeneous document images;International Journal on Document Analysis and Recognition (IJDAR);2024-08-16

2. End-to-end semi-supervised approach with modulated object queries for table detection in documents;International Journal on Document Analysis and Recognition (IJDAR);2024-07-10

3. From Detection to Application: Recent Advances in Understanding Scientific Tables and Figures;ACM Computing Surveys;2024-06-22

4. Improving table detection for document images using boundary;Complex & Intelligent Systems;2023-09-30

5. WEATHERGOV+;Proceedings of the ACM Symposium on Document Engineering 2023;2023-08-22