Siamese CNN-BiLSTM Architecture for 3D Shape Representation Learning-Reference-Cited by-同舟云学术

Siamese CNN-BiLSTM Architecture for 3D Shape Representation Learning

Published:2018-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Dai Guoxian¹²³,Xie Jin¹²,Fang Yi¹²⁴

Affiliation:

1. NYU Multimedia and Visual Computing Lab

2. Dept. of ECE, NYU Abu Dhabi, UAE

3. Dept. of CSE, NYU Tandon School of Engineering, USA

4. Dept. of ECE, NYU Tandon School of Engineering, USA

Abstract

Learning a 3D shape representation from a collection of its rendered 2D images has been extensively studied. However, existing view-based techniques have not yet fully exploited the information among all the views of projections. In this paper, by employing recurrent neural network to efficiently capture features across different views, we propose a siamese CNN-BiLSTM network for 3D shape representation learning. The proposed method minimizes a discriminative loss function to learn a deep nonlinear transformation, mapping 3D shapes from the original space into a nonlinear feature space. In the transformed space, the distance of 3D shapes with the same label is minimized, otherwise the distance is maximized to a large margin. Specifically, the 3D shapes are first projected into a group of 2D images from different views. Then convolutional neural network (CNN) is adopted to extract features from different view images, followed by a bidirectional long short-term memory (LSTM) to aggregate information across different views. Finally, we construct the whole CNN-BiLSTM network into a siamese structure with contrastive loss function. Our proposed method is evaluated on two benchmarks, ModelNet40 and SHREC 2014, demonstrating superiority over the state-of-the-art methods.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Novel Method Based on Topological Perception Theory for 3D Landmark Building Model Retrieval;Applied Sciences;2024-01-30

2. V$$^2$$MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition;The Visual Computer;2023-12-21

3. A rotation robust shape transformer for cartoon character recognition;The Visual Computer;2023-10-27

4. SVDFormer: Complementing Point Cloud via Self-view Augmentation and Self-structure Dual-generator;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

5. Rapid Earthquake Magnitude Estimation Using Deep Learning;2022 International Joint Conference on Neural Networks (IJCNN);2022-07-18