A Manufacturing-Oriented Intelligent Vision System Based on Deep Neural Network for Object Recognition and 6D Pose Estimation-Reference-Cited by-同舟云学术

A Manufacturing-Oriented Intelligent Vision System Based on Deep Neural Network for Object Recognition and 6D Pose Estimation

Published:2021-01-07 Issue: Volume:14 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Liang Guoyuan,Chen Fan,Liang Yu,Feng Yachun,Wang Can,Wu Xinyu

Abstract

Nowadays, intelligent robots are widely applied in the manufacturing industry, in various working places or assembly lines. In most manufacturing tasks, determining the category and pose of parts is important, yet challenging, due to complex environments. This paper presents a new two-stage intelligent vision system based on a deep neural network with RGB-D image inputs for object recognition and 6D pose estimation. A dense-connected network fusing multi-scale features is first built to segment the objects from the background. The 2D pixels and 3D points in cropped object regions are then fed into a pose estimation network to make object pose predictions based on fusion of color and geometry features. By introducing the channel and position attention modules, the pose estimation network presents an effective feature extraction method, by stressing important features whilst suppressing unnecessary ones. Comparative experiments with several state-of-the-art networks conducted on two well-known benchmark datasets, YCB-Video and LineMOD, verified the effectiveness and superior performance of the proposed method. Moreover, we built a vision-guided robotic grasping system based on the proposed method using a Kinova Jaco2 manipulator with an RGB-D camera installed. Grasping experiments proved that the robot system can effectively implement common operations such as picking up and moving objects, thereby demonstrating its potential to be applied in all kinds of real-time manufacturing applications.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference40 articles.

1. Segnet: a deep convolutional encoder-decoder architecture for scene segmentation;Badrinarayanan;IEEE Trans. Pattern Anal. Mach. Intell,2017

2. Learning 6D object pose estimation using 3D object coordinates;Brachmann,2014

3. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs;Chen;IEEE Trans. Pattern Anal. Mach. Intell

4. Rethinking atrous convolution for semantic image segmentation;Chen

5. The importance of skip connections in biomedical image segmentation;Drozdzal,2016

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DON6D: a decoupled one-stage network for 6D pose estimation;Scientific Reports;2024-04-10

2. Real-Time Assessment of Rodent Engagement Using ArUco Markers: A Scalable and Accessible Approach for Scoring Behavior in a Nose-Poking Go/No-Go Task;eneuro;2024-02-13

3. DCSPose: A Dual-Channel Siamese Framework for Unseen Textureless Object Pose Estimation;Applied Sciences;2024-01-15

4. Recent Developments in Robotic Grasping Detection;Lecture Notes in Networks and Systems;2024

5. Applications of Uncalibrated Image Based Visual Servoing in Micro- and Macroscale Robotics;2023 IEEE 19th International Conference on Automation Science and Engineering (CASE);2023-08-26