A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision-Reference-Cited by-同舟云学术

A Review of Machine Learning and Deep Learning for Object Detection, Semantic Segmentation, and Human Action Recognition in Machine and Robotic Vision

Published:2024-01-23 Issue:2 Volume:12 Page:15
ISSN:2227-7080
Container-title:Technologies
language:en
Short-container-title:Technologies

Author:

Manakitsa Nikoleta¹^ORCID,Maraslidis George S.¹^ORCID,Moysis Lazaros²³^ORCID,Fragulis George F.¹^ORCID

Affiliation:

1. Department of Electrical and Computer Engineering, University of Western Macedonia, 50100 Kozani, Greece

2. Laboratory of Nonlinear Systems-Circuits and Complexity, Physics Department, Aristotle University of Thessaloniki, 54624 Thessaloniki, Greece

3. Department of Mechanical Engineering, University of Western Macedonia, ZEP Campus, 50100 Kozani, Greece

Abstract

Machine vision, an interdisciplinary field that aims to replicate human visual perception in computers, has experienced rapid progress and significant contributions. This paper traces the origins of machine vision, from early image processing algorithms to its convergence with computer science, mathematics, and robotics, resulting in a distinct branch of artificial intelligence. The integration of machine learning techniques, particularly deep learning, has driven its growth and adoption in everyday devices. This study focuses on the objectives of computer vision systems: replicating human visual capabilities including recognition, comprehension, and interpretation. Notably, image classification, object detection, and image segmentation are crucial tasks requiring robust mathematical foundations. Despite the advancements, challenges persist, such as clarifying terminology related to artificial intelligence, machine learning, and deep learning. Precise definitions and interpretations are vital for establishing a solid research foundation. The evolution of machine vision reflects an ambitious journey to emulate human visual perception. Interdisciplinary collaboration and the integration of deep learning techniques have propelled remarkable advancements in emulating human behavior and perception. Through this research, the field of machine vision continues to shape the future of computer systems and artificial intelligence applications.

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7080/12/2/15/pdf

Reference188 articles.

1. A survey on deep multimodal learning for computer vision: Advances, trends, applications, and datasets;Bayoudh;Vis. Comput.,2021

2. Robotic Vision for Human-Robot Interaction and Collaboration: A Survey and Systematic Review;Robinson;ACM Trans. Hum.-Robot. Interact.,2023

3. Computer Vision for Supporting Visually Impaired People: A Systematic Review;Anthony;Eng. Math. Comput. Sci. (Emacs) J.,2021

4. Deep learning for computer vision: A brief review;Voulodimos;Comput. Intell. Neurosci.,2018

5. Deep Learning for Object Detection and Scene Perception in Self-Driving Cars: Survey, Challenges, and Open Issues;Gupta;Array,2021

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computer vision for enhanced quantification of FEA of ballistic impact;International Journal of Mechanical Sciences;2024-12

2. Transfer of Periodic Phenomena in Multiphase Capillary Flows to a Quasi-Stationary Observation Using U-Net;Computers;2024-09-13

3. AI-powered trustable and explainable fall detection system using transfer learning;Image and Vision Computing;2024-09

4. Oncologic Applications of Artificial Intelligence and Deep Learning Methods in CT Spine Imaging—A Systematic Review;Cancers;2024-08-28

5. Using ArcFace Loss Function and Softmax with Temperature Activation Function for Improvement in X-ray Baggage Image Classification Quality;Mathematics;2024-08-18