Keyword Detection Based on RetinaNet and Transfer Learning for Personal Information Protection in Document Images-Reference-Cited by-同舟云学术

Keyword Detection Based on RetinaNet and Transfer Learning for Personal Information Protection in Document Images

Published:2021-10-13 Issue:20 Volume:11 Page:9528
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Lin Guo-Shiang,Tu Jia-Cheng,Lin Jen-Yung

Abstract

In this paper, a keyword detection scheme is proposed based on deep convolutional neural networks for personal information protection in document images. The proposed scheme is composed of key character detection and lexicon analysis. The first part is the key character detection developed based on RetinaNet and transfer learning. To find the key characters, RetinaNet, which is composed of convolutional layers featuring a pyramid network and two subnets, is exploited to detect key characters within the region of interest in a document image. After the key character detection, the second part is a lexicon analysis, which analyzes and combines several key characters to find the keywords. To train the model of RetinaNet, synthetic image generation and data augmentation are exploited to yield a large image dataset. To evaluate the proposed scheme, many document images are selected for testing, and two performance measurements, IoU (Intersection Over Union) and mAP (Mean Average Precision), are used in this paper. Experimental results show that the mAP rates of the proposed scheme are 85.1% and 85.84% for key character detection and keyword detection, respectively. Furthermore, the proposed scheme is superior to Tesseract OCR (Optical Character Recognition) software for detecting the key characters in document images. The experimental results demonstrate that the proposed method can effectively localize and recognize these keywords within noisy document images with Mandarin Chinese words.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/20/9528/pdf

Reference38 articles.

1. A Document Image Retrieval System

2. Detecting region of interest for cadastral images in Taiwan

3. Vision-based patient identification recognition based on image content analysis and support vector machine for medical information system

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SemiDocSeg: harnessing semi-supervised learning for document layout analysis;International Journal on Document Analysis and Recognition (IJDAR);2024-06-04

2. GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation;Lecture Notes in Computer Science;2024

3. Deep Convolutional Neural Network with a Stochastic Gradient Descent Optimizer (PDCNN-SGD) model for telugu character recognition;i-manager’s Journal on Image Processing;2023

4. SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation;Lecture Notes in Computer Science;2023

5. Assessment of Different Object Detectors for the Maturity Level Classification of Broccoli Crops Using UAV Imagery;Remote Sensing;2022-02-04