Temporal-Aware Neural Network for Dense Non-Rigid Structure from Motion-Reference-Cited by-同舟云学术

Temporal-Aware Neural Network for Dense Non-Rigid Structure from Motion

Published:2023-09-19 Issue:18 Volume:12 Page:3942
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Wang Yaming¹²,Xu Dawei¹,Huang Wenqing¹^ORCID,Ye Xiaoping²,Jiang Mingfeng¹

Affiliation:

1. Pattern Recognition and Computer Vision Lab, Zhejiang Sci-Tech University, Hangzhou 310018, China

2. Key Laboratory of Digital Design and Intelligent Manufacture in Culture & Creativity Product of Zhejiang Province, Lishui University, Lishui 323000, China

Abstract

Modern neural networks addressing dense Non-Rigid Structure from Motion (NRSFM) dilemmas often grapple with intricate a priori constraints, deterring scalability, or overlook the imperative of consistent application of a priori knowledge throughout the entire input sequence. In this paper, an innovative neural network architecture is introduced. Initially, the complete 2D sequence image undergoes embedding into a low-dimensional space. Subsequently, multiple self-attention layers are employed to extract inter-frame features, with the objective of deriving a more continuous and temporally smooth low-dimensional structure closely resembling real data’s intrinsic structure. Moreover, it has been demonstrated by others that gradient descent during the training of multilayer linear networks yields minimum rank solutions, implicitly providing regularization that is equally applicable to this task. Benefiting from the excellence of the proposed network architecture, no additional a priori knowledge is mandated, barring the constraint of temporal smoothness. Extensive experimentation confirms the method’s exceptional performance in addressing dense NRSFM challenges, outperforming recent results across various dense benchmark datasets.

Funder

the Natural Science Foundation of Zhejiang Province

the National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/18/3942/pdf

Reference47 articles.

1. Wang, C., and Lucey, S. (2021, January 19–25). Paul: Procrustean autoencoder for unsupervised lifting. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Online.

2. Russell, C., Fayad, J., and Agapito, L. (2012, January 3–5). Dense non-rigid structure from motion. Proceedings of the 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission, Li’ege, Belgium.

3. Golyanik, V., and Stricker, D. (2017, January 24–31). Dense batch non-rigid structure from motion in a second. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA.

4. Kumar, S., and Van Gool, L. (2022, January 23–27). Organic priors in non-rigid structure from motion. Proceedings of the European Conference on Computer Vision, Tel Aviv, Israel.

5. A closed-form uncertainty propagation in non-rigid structure from motion;Song;IEEE Robot. Autom. Lett.,2022