Author:
Suh Seungah,Lee Ghang,Gil Daeyoung,Kim Yonghan
Abstract
AbstractAutomated text recognition techniques have made significant advancements; however, certain tasks still present challenges. This study is motivated by the need to automatically recognize hand-marked text on construction defect tags among millions of photographs. To address this challenge, we investigated three methods for automating hand-marked semantic text recognition (HMSTR)—a modified scene text recognition-based (STR) approach, a two-step HMSTR approach, and a lumped approach. The STR approach involves locating marked text using an object detection model and recognizing it using a competition-winning STR model. Similarly, the two-step HMSTR approach first localizes the marked text and then recognizes the semantic text using an image classification model. By contrast, the lumped approach performs both localization and identification of marked semantic text in a single step using object detection. Among these approaches, the two-step HMSTR approach achieved the highest F1 score (0.92) for recognizing circled text, followed by the STR approach (0.87) and the lumped approach (0.78). To validate the generalizability of the two-step HMSTR approach, subsequent experiments were conducted using check-marked text, resulting in an F1 score of 0.88. Although the proposed methods have been tested specifically with tags, they can be extended to recognize marked text in reports or books.
Funder
National Research Foundation of Korea
Publisher
Springer Science and Business Media LLC
Reference46 articles.
1. Van Phan, T., Cong Nguyen, K. & Nakagawa, M. A Nom historical document recognition system for digital archiving. Int. J. Doc. Anal. Recognit. 19, 49–64 (2016).
2. Shi, B., Bai, X. & Yao, C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2298–2304 (2017).
3. Shi, B., Wang, X., Lyu, P., Yao, C. & Bai, X. Robust scene text recognition with automatic rectification. Proc. IEEE Comput. Vis. Pattern Recognit. 2016, 4168–4176 (2016).
4. Plamondon, R. & Srihari, S. N. Online and off-line handwriting recognition: A comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 22, 63–84 (2000).
5. Schäfer, B., van Aa, H., Leopold, H. & Stuckenschmidt, H. Sketch2BPMN: Automatic recognition of hand-drawn BPMN models. In Advanced Information System Engineering Vol. 12751 (eds LaRosa, M. et al.) 344–360 (Springer, 2021).