SEDIQA: Sound Emitting Document Image Quality Assessment in a Reading Aid for the Visually Impaired-Reference-Cited by-同舟云学术

SEDIQA: Sound Emitting Document Image Quality Assessment in a Reading Aid for the Visually Impaired

Published:2021-08-30 Issue:9 Volume:7 Page:168
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Courtney Jane^ORCID

Abstract

For visually impaired people (VIPs), the ability to convert text to sound can mean a new level of independence or the simple joy of a good book. With significant advances in optical character recognition (OCR) in recent years, a number of reading aids are appearing on the market. These reading aids convert images captured by a camera to text which can then be read aloud. However, all of these reading aids suffer from a key issue—the user must be able to visually target the text and capture an image of sufficient quality for the OCR algorithm to function—no small task for VIPs. In this work, a sound-emitting document image quality assessment metric (SEDIQA) is proposed which allows the user to hear the quality of the text image and automatically captures the best image for OCR accuracy. This work also includes testing of OCR performance against image degradations, to identify the most significant contributors to accuracy reduction. The proposed no-reference image quality assessor (NR-IQA) is validated alongside established NR-IQAs and this work includes insights into the performance of these NR-IQAs on document images. SEDIQA is found to consistently select the best image for OCR accuracy. The full system includes a document image enhancement technique which introduces improvements in OCR accuracy with an average increase of 22% and a maximum increase of 68%.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging

Link

https://www.mdpi.com/2313-433X/7/9/168/pdf

Reference52 articles.

1. The Evaluation of Mobile Applications as Low Vision Aids: The Patient Perspective;Dockery;Invest. Ophthalmol. Vis. Sci.,2020

2. Commentary: An app a day keeps the eye doctor busy

3. A Systematic Review of Urban Navigation Systems for Visually Impaired People

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Document Image Quality Assessment: A Survey;ACM Computing Surveys;2023-09-14

2. Detection of Antibiotic Constituent in Aspergillus flavus Using Quantum Convolutional Neural Network;International Journal of E-Health and Medical Communications;2023-04-14

3. Restoring severely out-of-focus blurred text images with Deep Image Prior;Inverse Problems and Imaging;2022