Helping the Visually Impaired See via Image Multi-labeling Based on SqueezeNet CNN-Reference-Cited by-同舟云学术

Helping the Visually Impaired See via Image Multi-labeling Based on SqueezeNet CNN

Published:2019-11-01 Issue:21 Volume:9 Page:4656
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Alhichri Haikel^ORCID,Bazi Yakoub^ORCID,Alajlan Naif,Bin Jdira Bilel

Abstract

This work presents a deep learning method for scene description. (1) Background: This method is part of a larger system, called BlindSys, that assists the visually impaired in an indoor environment. The method detects the presence of certain objects, regardless of their position in the scene. This problem is also known as image multi-labeling. (2) Methods: Our proposed deep learning solution is based on a light-weight pre-trained CNN called SqueezeNet. We improved the SqueezeNet architecture by resetting the last convolutional layer to free weights, replacing its activation function from a rectified linear unit (ReLU) to a LeakyReLU, and adding a BatchNormalization layer thereafter. We also replaced the activation functions at the output layer from softmax to linear functions. These adjustments make up the main contributions in this work. (3) Results: The proposed solution is tested on four image multi-labeling datasets representing different indoor environments. It has achieved results better than state-of-the-art solutions both in terms of accuracy and processing time. (4) Conclusions: The proposed deep CNN is an effective solution for predicting the presence of objects in a scene and can be successfully used as a module within BlindSys.

Funder

National Plan for Science, Technology and Innovation

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/21/4656/pdf

Reference40 articles.

1. Vision Impairment and Blindness https://www.who.int/news-room/fact-sheets/detail/blindness-and-visual-impairment

2. US Patent Application for Refreshable Braille Display Patent Application (Application #20130203022 issued 8 August 2013)—Justia Patents Search https://patents.justia.com/patent/20130203022

3. Screen Reader/2—Programmed access to the GUI;Thatcher,1994

4. IBM SCREEN READER/2 www-01.ibm.com/common/ssi/cgi-bin/ssialias

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comprehensive framework for advanced protein classification and function prediction using synergistic approaches: Integrating bispectral analysis, machine learning, and deep learning;PLOS ONE;2023-12-14

2. Pepper leaf disease recognition based on enhanced lightweight convolutional neural networks;Frontiers in Plant Science;2023-08-09

3. Compressing convolutional neural networks with cheap convolutions and online distillation;Displays;2023-07

4. Towards Web-Based Automation: A Comparative Analysis of Feature Extraction Approaches and Applications for Quality Control;2023 9th International Conference on Web Research (ICWR);2023-05-03

5. A one-stage deep learning method for fully automated mesiodens localization on panoramic radiographs;Biomedical Signal Processing and Control;2023-02