Toward Semi-Supervised Graphical Object Detection in Document Images-Reference-Cited by-同舟云学术

Toward Semi-Supervised Graphical Object Detection in Document Images

Published:2022-06-08 Issue:6 Volume:14 Page:176
ISSN:1999-5903
Container-title:Future Internet
language:en
Short-container-title:Future Internet

Author:

Kallempudi Goutham^ORCID,Hashmi Khurram Azeem^ORCID,Pagani Alain,Liwicki Marcus^ORCID,Stricker Didier,Afzal Muhammad Zeshan^ORCID

Abstract

The graphical page object detection classifies and localizes objects such as Tables and Figures in a document. As deep learning techniques for object detection become increasingly successful, many supervised deep neural network-based methods have been introduced to recognize graphical objects in documents. However, these models necessitate a substantial amount of labeled data for the training process. This paper presents an end-to-end semi-supervised framework for graphical object detection in scanned document images to address this limitation. Our method is based on a recently proposed Soft Teacher mechanism that examines the effects of small percentage-labeled data on the classification and localization of graphical objects. On both the PubLayNet and the IIIT-AR-13K datasets, the proposed approach outperforms the supervised models by a significant margin in all labeling ratios (1%, 5%, and 10%). Furthermore, the 10% PubLayNet Soft Teacher model improves the average precision of Table, Figure, and List by +5.4,+1.2, and +3.2 points, respectively, with a similar total mAP as the Faster-RCNN baseline. Moreover, our model trained on 10% of IIIT-AR-13K labeled data beats the previous fully supervised method +4.5 points.

Publisher

MDPI AG

Subject

Computer Networks and Communications

Link

https://www.mdpi.com/1999-5903/14/6/176/pdf

Reference57 articles.

1. Evaluating Human versus Machine Learning Performance in a LegalTech Problem

2. A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Structures

3. Table Detection in Noisy Off-line Handwritten Documents

4. Feedback learning: Automating the process of correcting and completing the extracted information;Hashmi;Proceedings of the 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW),2019

5. Graphical Object Detection in Document Images

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robust page object detection network for heterogeneous document images;International Journal on Document Analysis and Recognition (IJDAR);2024-08-16

2. Towards End-to-End Semi-supervised Table Detection with Semantic Aligned Matching Transformer;Lecture Notes in Computer Science;2024

3. A Hybrid Approach for Document Layout Analysis in Document Images;Lecture Notes in Computer Science;2024

4. The YOLO model that still excels in document layout analysis;Signal, Image and Video Processing;2023-11-19

5. Towards End-to-End Semi-Supervised Table Detection with Deformable Transformer;Lecture Notes in Computer Science;2023