PCBSNet: A Pure Convolutional Bilateral Segmentation Network for Real-Time Natural Scene Text Detection-Reference-Cited by-同舟云学术

PCBSNet: A Pure Convolutional Bilateral Segmentation Network for Real-Time Natural Scene Text Detection

Published:2023-07-12 Issue:14 Volume:12 Page:3055
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Lian Zhe¹,Yin Yanjun¹,Zhi Min¹,Xu Qiaozhi¹^ORCID

Affiliation:

1. College of Computer Science and Technology, Inner Mongolia Normal University, Hohhot 010022, China

Abstract

Scene text detection is a fundamental research work in the field of image processing and has extensive application value. Segmentation-based methods have time-consuming feature processing, while post-processing algorithms are excellent. Real-time semantic segmentation methods use lightweight backbone networks for feature extraction and aggregation but lack effective post-processing methods. The pure convolutional network improves model performance by changing key components. Combining the advantages of three types of methods, we propose a Pure Convolutional Bilateral Segmentation Network (PCBSNet) for real-time natural scene text detection. First, we constructed a bilateral feature extraction backbone network to significantly improve detection speed. The low extraction detail branch captures spatial information, while the efficient semantic extraction branch accurately captures semantic features through a series of micro designs. Second, we built an efficient attention aggregation module to guide the efficient and adaptive aggregation of features from the two branches. The fused feature map undergoes feature enhancement to obtain more accurate and reliable feature representation. Finally, we used differentiable binarization post-processing to construct text instance boundaries. To evaluate the effectiveness of the proposed model, we compared it with mainstream lightweight models on three datasets: ICDAR2015, MSRA-TD500, and CTW1500. The F-measure scores were 82.9%, 82.8%, and 78.9%, respectively, and the FPS were 59.1, 94.3, and 75.5 frames per second. We also conducted extensive ablation experiments on the ICDAR2015 dataset to validate the rationality of the proposed improvements. The obtained results indicate that the proposed model significantly improves inference speed while enhancing accuracy and demonstrates good competitiveness compared to other advanced detection methods. However, when faced with curved text, the detection performance of PCBSNet needs to be improved.

Funder

Natural Science Foundation of Inner Mongolia

Research Science Institute of Colleges and Universities in Inner Mongolia

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/14/3055/pdf

Reference52 articles.

1. An automated driving systems data acquisition and analytics platform;Xia;Transp. Res. Part C Emerg. Technol.,2023

2. Meng, Z., Xia, X., Xu, R., Liu, W., and Ma, J. (2023). HYDRO-3D: Hybrid Object Detection and Tracking for Cooperative Perception Using 3D LiDAR. IEEE Trans. Intell. Veh.

3. YOLOv5-Tassel: Detecting tassels in RGB UAV imagery with improved YOLOv5 based on transfer learning;Liu;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2022

4. License Plate Recognition System Based on Improved YOLOv5 and GRU;Shi;IEEE Access,2023

5. Text detection, tracking and recognition in video: A comprehensive survey;Yin;IEEE Trans Image Process.,2016

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Differential Analysis of Modern Text Spotting Methods : A Systematic Review;International Journal of Scientific Research in Computer Science, Engineering and Information Technology;2024-09-05

2. (HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross-Entropy;Entropy;2024-06-29

3. Refinement Correction Network for Scene Text Detection;Lecture Notes in Computer Science;2024

4. A Survey: Feature Fusion Method for Object Detection Field;Lecture Notes in Computer Science;2024