Affiliation:
1. Department of Information Technology, Dharmsinh Desai University, Nadiad-387001, Gujarat, India
2. Department of Computer Science, DAIICT, Gandhinagar, Gujarat, India
Abstract
Textual information is the most common type of way by which we can determine what text/texts we are looking for. In order to retrieve text from images the first and foremost step is text detection from the image. Text detection has a wide range of applications such as translation, smart car driving system, information retrieval, indexing of multimedia archives, sign board reading, and countless. Multilingual text detection from images adds an extra complication to a computer vision problem. As India is a multilingual country and therefore multi-script texts can be found almost everywhere. A multi-script text differs in terms of formats, strokes, width, and height. Also, universal features for such an environment are unknown and difficult to determine as well. Therefore, detecting multi-script text from images is an important yet unsolved problem. In this work, we proposed a faster RCNN-based method for detecting English, Hindi, and Gujarati text from Images. Faster RCNN is the state-of-the-art approach for object detection. As it works for objects which are of large size and texts are of smaller size, the parameters are tuned to meet the objective of multi-script text detection. The dataset is created by collecting images as there is no standard dataset available that includes English, Gujarati, and Hindi texts in the public domain.
Publisher
World Scientific Pub Co Pte Ltd
Subject
General Earth and Planetary Sciences,General Engineering,General Environmental Science
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献