Ultimate pose estimation: A comparative study-Reference-Cited by-同舟云学术

Ultimate pose estimation: A comparative study

Published:2024-03-31 Issue:9 Volume:41 Page:
ISSN:0266-4720
Container-title:Expert Systems
language:en
Short-container-title:Expert Systems

Author:

Hassan Esraa¹,Hossain M. Shamim²^ORCID,Elmuogy Samir³,Ghoneim Ahmed²^ORCID,AlMutib Khalid²,Saber Abeer⁴

Affiliation:

1. Department Machine learning and Information Retrieval, Faculty of Artificial Intelligence Kafrelsheikh University Kafr El Sheikh Egypt

2. Department of Software Engineering, College of Computer and Information Sciences King Saud University Riyadh Saudi Arabia

3. Department of Computer Science, Faculty of Computers and Information Mansoura University Mansoura Egypt

4. Information Technology Department, Faculty of Computers and Artificial intelligence Damietta University Egypt

Abstract

AbstractPose estimation is a computer vision task used to detect and estimate the pose of a person or an object in images or videos. It has some challenges that can leverage advances in computer vision research and others that require efficient solutions. In this paper, we provide a preliminary review of the state‐of‐the‐art in pose estimation, including both traditional and deep learning approaches. Also, we implement and compare the performance of Hand Pose Estimation (HandPE), which uses PoseNet architecture for hand sign problems, for an ASL dataset by using different optimizers based on 10 common evaluation metrics on different datasets. Also, we discuss some related future research directions in the field of pose estimation and explore new architectures for pose estimation types. After applying the PoseNet model, the experiment results showed that the accuracy achieved was 99.9%, 89%, 97%, 79%, and 99% for the ASL alphabet, HARPET, Yoga, Animal, and Head datasets, comparing those with common optimizers and evaluation metrics on different dataset.

Funder

King Saud University

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/exsy.13586

Reference66 articles.

1. A Data-Driven Approach to Improve 3D Head-Pose Estimation

2. Albiero V. Chen X. Yin X. Pang G. &Hassner T.(2020).img2pose: Face Alignment and Detection via 6DoF Face Pose Estimation.http://arxiv.org/abs/2012.07791

3. Deep Learning for EEG motor imagery classification based on multi-layer CNNs feature fusion

4. Artacho B. &Savakis A.(2021).BAPose: Bottom‐up pose estimation with disentangled waterfall representations.http://arxiv.org/abs/2112.10716

5. ASL dataset: ASL Datasets: Image data set for alphabets in the American Sign Language.https://www.kaggle.com/datasets/grassknoted/asl-alphabet