Abstract
AbstractDeep neural networks (DNNs) in 3D scenes show a strong capability of extracting high-level semantic features and significantly promote research in the 3D field. 3D shapes and scenes often exhibit complicated transformation symmetries, where rotation is a challenging and necessary subject. To this end, many rotation invariant and equivariant methods have been proposed. In this survey, we systematically organize and comprehensively overview all methods. First, we rewrite the previous definition of rotation invariance and equivariance by classifying them into weak and strong categories. Second, we provide a unified theoretical framework to analyze these methods, especially weak rotation invariant and equivariant ones that are seldom analyzed theoretically. We then divide existing methods into two main categories, i.e., rotation invariant ones and rotation equivariant ones, which are further subclassified in terms of manipulating input ways and basic equivariant block structures, respectively. In each subcategory, their common essence is highlighted, a couple of representative methods are analyzed, and insightful comments on their pros and cons are given. Furthermore, we deliver a general overview of relevant applications and datasets for two popular tasks of 3D semantic understanding and molecule-related. Finally, we provide several open problems and future research directions based on challenges and difficulties in ongoing research.
Funder
National Science Foundation of China
Publisher
Springer Science and Business Media LLC
Reference343 articles.
1. Almakady Y, Mahmoodi S, Conway J et al (2020) Rotation invariant features based on three dimensional Gaussian Markov random fields for volumetric texture classification. Comput Vis Image Underst 194(102):931. https://doi.org/10.1016/j.cviu.2020.102931
2. Anderson B, Hy TS, Kondor R (2019) Cormorant: covariant molecular neural networks. In: Advances in neural information processing systems (NeurIPS), vol 32. Curran Associates, Inc
3. Andrearczyk V, Depeursinge A (2018) Rotational 3d texture classification using group equivariant cnns. arXiv preprint arXiv:1810.06889
4. Andrearczyk V, Fageot J, Oreiller V, et al (2019) Exploring local rotation invariance in 3d cnns with steerable filters. In: Proceedings of The 2nd international conference on medical imaging with deep learning, proceedings of machine learning research, vol 102. PMLR, pp 15–26
5. Andrearczyk V, Fageot J, Oreiller V et al (2020) Local rotation invariance in 3d cnns. Med Image Anal 65(101):756. https://doi.org/10.1016/j.media.2020.101756