Multi-Modal 3D Shape Clustering with Dual Contrastive Learning-Reference-Cited by-同舟云学术

Multi-Modal 3D Shape Clustering with Dual Contrastive Learning

Published:2022-07-22 Issue:15 Volume:12 Page:7384
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Lin Guoting^ORCID,Zheng Zexun,Chen Lin,Qin Tianyi^ORCID,Song Jiahui

Abstract

3D shape clustering is developing into an important research subject with the wide applications of 3D shapes in computer vision and multimedia fields. Since 3D shapes generally take on various modalities, how to comprehensively exploit the multi-modal properties to boost clustering performance has become a key issue for the 3D shape clustering task. Taking into account the advantages of multiple views and point clouds, this paper proposes the first multi-modal 3D shape clustering method, named the dual contrastive learning network (DCL-Net), to discover the clustering partitions of unlabeled 3D shapes. First, by simultaneously performing cross-view contrastive learning within multi-view modality and cross-modal contrastive learning between the point cloud and multi-view modalities in the representation space, a representation-level dual contrastive learning module is developed, which aims to capture discriminative 3D shape features for clustering. Meanwhile, an assignment-level dual contrastive learning module is designed by further ensuring the consistency of clustering assignments within the multi-view modality, as well as between the point cloud and multi-view modalities, thus obtaining more compact clustering partitions. Experiments on two commonly used 3D shape benchmarks demonstrate the effectiveness of the proposed DCL-Net.

Funder

National Natural Science Foundation of China

China Postdoctoral Science Foundation

Tianjin Research Innovation Project for Postgraduate Students

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/15/7384/pdf

Reference57 articles.

1. Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval

3. 3D shape recognition and retrieval based on multi-modality deep learning

4. Geometric Back-projection Network for Point Cloud Classification

5. 3D2SeqViews: Aggregating Sequential Views for 3D Global Feature Learning by CNN With Hierarchical Attention Aggregation

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fast Dynamic Multi-view Clustering with semantic-consistency inheritance;Knowledge-Based Systems;2024-09

2. Contrastive Multi-View Learning for 3D Shape Clustering;IEEE Transactions on Multimedia;2024

3. Efficient estimation of the number of clusters for high-dimension data;The Journal of Defense Modeling and Simulation: Applications, Methodology, Technology;2023-12-06

4. Multi-Modal Learning for Predicting the Genotype of Glioma;IEEE Transactions on Medical Imaging;2023-11