A real-time arbitrary-shape text detector-Reference-Cited by-同舟云学术

A real-time arbitrary-shape text detector

Published:2024-04-16 Issue:4 Volume:19 Page:e0302234
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Lu Manhuai,Li Langlang,Chen Chin-Ling^ORCID

Abstract

It is challenging to detect arbitrary-shape text accurately and effectively in natural scenes. While many methods have been implemented for arbitrary-shape text detection, most cannot achieve real-time detection or meet practical needs. In this work, we propose a YOLOv6-based detector that can effectively implement arbitrary-shape text detection and achieve real-time detection. We include two additional branches in the neck part of the YOLOv6 network to adapt the network to text detection, and the output side uses the pixel aggregation (PA) algorithm to decouple the PA output to use it as the detection head of the model. Experiments on benchmark Total-Text, CTW1500, ICDAR2015, and MSRA-TD500 showed that the proposed method outperformed competing methods in terms of detection accuracy and running time. Specifically, our method achieved an F-measure of 84.1% at 291.8 FPS for 640 × 640 Total-Text images and an F-measure of 81.5% at 199.6 FPS for 896 × 896 ICDAR2015 incidental text images.

Funder

Major Program of National Fund of Philosophy and Social Science of China

Publisher

Public Library of Science (PLoS)

Reference63 articles.

1. Kang C, Kim G, Yoo SI. Detection and recognition of text embedded in online images via neural context models. In: Thirty-First AAAI Conference on Artificial Intelligence;.

2. Xiong B, Grauman K. Text detection in stores using a repetition prior. In: 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE;. p. 1–9.

3. Scene text recognition in mobile applications by character descriptor and structure configuration;C Yi;IEEE Trans Image Process,2014

4. Text detection and recognition in imagery: A survey;Q Ye;IEEE transactions on pattern analysis and machine intelligence,2014

5. Aster: An attentional scene text recognizer with flexible rectification;B Shi;IEEE transactions on pattern analysis and machine intelligence,2018