MixFormer: A Self-Attentive Convolutional Network for 3D Mesh Object Recognition-Reference-Cited by-同舟云学术

MixFormer: A Self-Attentive Convolutional Network for 3D Mesh Object Recognition

Published:2023-03-21 Issue:3 Volume:16 Page:171
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Huang Lingfeng¹,Zhao Jieyu¹,Chen Yu¹

Affiliation:

1. Mobile Network Application Technology Laboratory, School of Information Science and Engineering, Ningbo University, 818 Fenghua Road, Ningbo 315211, China

Abstract

3D mesh as a complex data structure can provide effective shape representation for 3D objects, but due to the irregularity and disorder of the mesh data, it is difficult for convolutional neural networks to be directly applied to 3D mesh data processing. At the same time, the extensive use of convolutional kernels and pooling layers focusing on local features can cause the loss of spatial information and dependencies of low-level features. In this paper, we propose a self-attentive convolutional network MixFormer applied to 3D mesh models. By defining 3D convolutional kernels and vector self-attention mechanisms applicable to 3D mesh models, our neural network is able to learn 3D mesh model features. Combining the features of convolutional networks and transformer networks, the network can focus on both local detail features and long-range dependencies between features, thus achieving good learning results without stacking multiple layers and saving arithmetic overhead compared to pure transformer architectures. We conduct classification and semantic segmentation experiments on SHREC15, SCAPE, FAUST, MIT, and Adobe Fuse datasets. Experimental results show that the network can achieve 96.7% classification and better segmentation results by using fewer parameters and network layers.

Funder

National Natural Science Foundation of China

National Natural Science Foundation of Zhejiang Province

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/16/3/171/pdf

Reference49 articles.

1. Hazard, C., Bhagat, A., Buddharaju, B.R., Liu, Z., Shao, Y., Lu, L., Omari, S., and Cui, H. (2022, January 19–20). Importance Is in Your Attention: Agent Importance Prediction for Autonomous Driving. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA.

2. Klingner, M., Muller, K., Mirzaie, M., Breitenstein, J., Termohlen, J.-A., and Fingscheidt, T. (2022, January 19–20). On the Choice of Data for Efficient Training and Validation of End-to-End Driving Models. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA.

3. Wang, J., Li, X., Sullivan, A., Abbott, L., and Chen, S. (2022, January 19–20). PointMotionNet: Point-Wise Motion Learning for Large-Scale LiDAR Point Clouds Sequences. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA.

4. Grishchenko, I., Ablavatski, A., Kartynnik, Y., Raveendran, K., and Grundmann, M. (2020). Attention Mesh: High-Fidelity Face Mesh Prediction in Real-Time. arXiv.

5. Cohn, B.A., Maselli, A., Ofek, E., and Gonzalez-Franco, M. (2020, January 14–18). SnapMove: Movement Projection Mapping in Virtual Reality. Proceedings of the 2020 IEEE International Conference on Artificial Intelligence and Virtual Reality (AIVR), Utrecht, The Netherlands.