Author:
Bian Xuewei,Wang Chaoqun,Quan Weize,Ye Juntao,Zhang Xiaopeng,Yan Dong-Ming
Abstract
AbstractRecent learning-based approaches show promising performance improvement for the scene text removal task but usually leave several remnants of text and provide visually unpleasant results. In this work, a novel end-to-end framework is proposed based on accurate text stroke detection. Specifically, the text removal problem is decoupled into text stroke detection and stroke removal; we design separate networks to solve these two subproblems, the latter being a generative network. These two networks are combined as a processing unit, which is cascaded to obtain our final model for text removal. Experimental results demonstrate that the proposed method substantially outperforms the state-of-the-art for locating and erasing scene text. A new large-scale real-world dataset with 12,120 images has been constructed and is being made available to facilitate research, as current publicly available datasets are mainly synthetic so cannot properly measure the performance of different methods.
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition
Reference39 articles.
1. Wu, L.; Zhang, C. Q.; Liu, J. M.; Han, J. Y.; Liu, J. T.; Ding, E. R.; Bai, X. Editing text in the wild. In: Proceedings of the 27th ACM International Conference on Multimedia, 1500–1508, 2019.
2. Khodadadi, M.; Behrad, A. Text localization, extraction and inpainting in color images. In: Proceedings of the 20th Iranian Conference on Electrical Engineering, 1035–1040, 2012.
3. Modha, U.; Dave, P. Image inpainting-automatic detection and removal of text from images. International Journal of Engineering Research and Applications Vol. 2, No. 2, 930–932, 2012.
4. Wagh, P. D.; Patil, D. R. Text detection and removal from image using inpainting with smoothing. In: Proceedings of the International Conference on Pervasive Computing, 1–4, 2015.
5. Lecture Notes in Computer Science;J Johnson,2016
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Deep Learning-Based Image and Video Inpainting: A Survey;International Journal of Computer Vision;2024-01-19
2. On Manipulating Scene Text in the Wild with Diffusion Models;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03
3. Scene Text Segmentation with Text-Focused Transformers;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26
4. Data Recognition for Multi-Source Heterogeneous Experimental Detection in Cloud Edge Collaboratives;International Journal of Information Technologies and Systems Approach;2023-09-26
5. FETNet: Feature erasing and transferring network for scene text removal;Pattern Recognition;2023-08