SEE: Towards Semi-Supervised End-to-End Scene Text Recognition-Reference-Cited by-同舟云学术

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Bartz Christian,Yang Haojin,Meinel Christoph

Abstract

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In recent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have been proposed. In this paper we present SEE, a step towards semi-supervised neural networks for scene text detection and recognition, that can be optimized end-to-end. Most existing works consist of multiple deep neural networks and several pre-processing steps. In contrast to this, we propose to use a single deep neural network, that learns to detect and recognize text from natural images, in a semi-supervised way. SEE is a network that integrates and jointly learns a spatial transformer network, which can learn to detect text regions in an image, and a text recognition network that takes the identified text regions and recognizes their textual content. We introduce the idea behind our novel approach and show its feasibility, by performing a range of experiments on standard benchmark datasets, where we achieve competitive results.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel approach for improving open scene text translation with modified GAN;The Visual Computer;2024-04-13

2. A person re‐identification method for sports event scenes incorporating textual information mining;IET Image Processing;2024-03-11

3. Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03

4. An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition;IEEE Access;2024

5. Synthetic Data Generation for Text Spotting on Printed Circuit Board Component Images;IEEE Access;2024