Multiscale Convolutional Neural Networks for Hand Detection

Author:

Yan Shiyang12ORCID,Xia Yizhang1,Smith Jeremy S.2,Lu Wenjin1,Zhang Bailing1ORCID

Affiliation:

1. Department of Computer Science and Software Engineering, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China

2. Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK

Abstract

Unconstrained hand detection in still images plays an important role in many hand-related vision problems, for example, hand tracking, gesture analysis, human action recognition and human-machine interaction, and sign language recognition. Although hand detection has been extensively studied for decades, it is still a challenging task with many problems to be tackled. The contributing factors for this complexity include heavy occlusion, low resolution, varying illumination conditions, different hand gestures, and the complex interactions between hands and objects or other hands. In this paper, we propose a multiscale deep learning model for unconstrained hand detection in still images. Deep learning models, and deep convolutional neural networks (CNNs) in particular, have achieved state-of-the-art performances in many vision benchmarks. Developed from the region-based CNN (R-CNN) model, we propose a hand detection scheme based on candidate regions generated by a generic region proposal algorithm, followed by multiscale information fusion from the popular VGG16 model. Two benchmark datasets were applied to validate the proposed method, namely, the Oxford Hand Detection Dataset and the VIVA Hand Detection Challenge. We achieved state-of-the-art results on the Oxford Hand Detection Dataset and had satisfactory performance in the VIVA Hand Detection Challenge.

Publisher

Hindawi Limited

Subject

Artificial Intelligence,Computer Networks and Communications,Computer Science Applications,Civil and Structural Engineering,Computational Mechanics

Cited by 24 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Multimodal Machine Learning for Sign Language Prediction;IFMBE Proceedings;2023-09-14

2. Hand gesture recognition with focus on leap motion: An overview, real world challenges and future directions;Expert Systems with Applications;2023-09

3. Study on HGR by Using Machine Learning;2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE);2023-05-12

4. Analysis of Machine Learning for Recognizing Hand Gestures;2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE);2023-05-12

5. Cybersecurity Assessment Construction of Artificial Intelligence;Advances on Intelligent Computing and Data Science;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3