PVFAN: Point-view fusion attention network for 3D shape recognition-Reference-Cited by-同舟云学术

PVFAN: Point-view fusion attention network for 3D shape recognition

Published:2023-11-04 Issue:5 Volume:45 Page:8119-8133
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Cao Jiangzhong¹,Liao Siyi¹

Affiliation:

1. School of Information Engineering, Guangdong University of Technology, Guangzhou, China

Abstract

3D shape recognition is a critical research topic in the field of computer vision, attracting substantial attention. Existing approaches mainly focus on extracting distinctive 3D shape features; however, they often neglect the model’s robustness and lack refinement in deep features. To address these limitations, we propose the point-view fusion attention network that aims to extract a concise, informative, and robust 3D shape descriptor. Initially, our approach combines multi-view features with point cloud features to obtain accurate and distinguishable fusion features. To effectively handle these fusion features, we design a dual-attention convolutional network which consists of a channel attention module and a spatial attention module. This dual-attention mechanism greatly enhances the generalization ability and robustness of 3D recognition models. Notably, we introduce a strip-pooling layer in the channel attention module to refine the features, resulting in improved fusion features that are more compact. Finally, a classification process is performed on the refined features to assign appropriate 3D shape labels. Our extensive experiments on the ModelNet10 and ModelNet40 datasets for 3D shape recognition and retrieval demonstrate the remarkable accuracy and robustness of the proposed method.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference65 articles.

1. Covariance-Based Descriptors for Efficient 3D Shape Matching, Retrieval, and Classification;Tabia;IEEE Transactions on Multimedia,2015

2. Comprehensive and Practical Vision System for Self-Driving Vehicle Lane-Level Localization;Du;IEEE Transactions on Image Processing,2016

3. Estimating Heart Rate and Rhythm via 3D Motion Tracking in Depth Video;Yang;IEEE Transactions on Multimedia,2017

4. 3D-shape recognition and size measurement of irregular rough particles using multi-views interferometric out-of-focus imaging;Ouldarbi;Applied Optics,2016

5. 3D Object Recognition in Cluttered Scenes with Local Surface Features: A Survey;Guo;IEEE Transactions on Pattern Analysis and Machine Intelligence,2014