Development of a Two-Stage Segmentation-Based Word Searching Method for Handwritten Document Images-Reference-Cited by-同舟云学术

Development of a Two-Stage Segmentation-Based Word Searching Method for Handwritten Document Images

Published:2018-07-04 Issue:1 Volume:29 Page:719-735
ISSN:2191-026X
Container-title:Journal of Intelligent Systems
language:
Short-container-title:

Author:

Malakar Samir¹,Ghosh Manosij²,Sarkar Ram²,Nasipuri Mita²

Affiliation:

1. Department of Computer Science, Asutosh College, Kolkata, India

2. Department of Computer Science and Engineering, Jadavpur University, Kolkata, India

Abstract

Abstract Word searching or keyword spotting is an important research problem in the domain of document image processing. The solution to the said problem for handwritten documents is more challenging than for printed ones. In this work, a two-stage word searching schema is introduced. In the first stage, all the irrelevant words with respect to a search word are filtered out from the document page image. This is carried out using a zonal feature vector, called pre-selection feature vector, along with a rule-based binary classification method. In the next step, a holistic word recognition paradigm is used to confirm a pre-selected word as search word. To accomplish this, a modified histogram of oriented gradients-based feature descriptor is combined with a topological feature vector. This method is experimented on a QUWI English database, which is freely available through the International Conference on Document Analysis and Recognition 2015 competition entitled “Writer Identification and Gender Classification.” This technique not only provides good retrieval performance in terms of recall, precision, and F-measure scores, but it also outperforms some state-of-the-art methods.

Publisher

Walter de Gruyter GmbH

Subject

Artificial Intelligence,Information Systems,Software

Link

https://www.degruyter.com/document/doi/10.1515/jisys-2017-0384/pdf

Reference76 articles.

1. Keyword spotting for self-training of BLSTM NN-based handwriting recognition systems;Pattern Recogn.,2014

2. A hierarchical approach to recognition of handwritten Bangla characters;Pattern Recogn.,2009

3. Learning-based word spotting system for Arabic handwritten documents;Pattern Recogn.,2014

4. Handwritten word-spotting using hidden Markov models and universal vocabularies;Pattern Recogn.,2009

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OMRNet: A lightweight deep learning model for optical mark recognition;Multimedia Tools and Applications;2023-07-12

2. Z-Transform-Based Profile Matching to Develop a Learning-Free Keyword Spotting Method for Handwritten Document Images;International Journal of Computational Intelligence Systems;2022-11-02

3. Handwritten Arabic and Roman word recognition using holistic approach;The Visual Computer;2022-05-19

4. Handwritten English word recognition using a deep learning based object detection architecture;Multimedia Tools and Applications;2021-09-20

5. Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting;Sensors;2021-07-07