Author:
Molnár Szilárd,Tamás Levente
Abstract
AbstractVariational autoencoders (VAEs) play an important role in high-dimensional data generation based on their ability to fuse the stochastic data representation with the power of recent deep learning techniques. The main advantages of these types of generators lie in their ability to encode the information with the possibility to decode and generalize new samples. This capability was heavily explored for 2D image processing; however, only limited research focuses on VAEs for 3D data processing. In this article, we provide a thorough review of the latest achievements in 3D data processing using VAEs. These 3D data types are mostly point clouds, meshes, and voxel grids, which are the focus of a wide range of applications, especially in robotics. First, we shortly present the basic autoencoder with the extensions towards the VAE with further subcategories relevant to discrete point cloud processing. Then, the 3D data specific VAEs are presented according to how they operate on spatial data. Finally, a few comprehensive table summarizing the methods, codes, and datasets as well as a citation map is presented for a better understanding of the VAEs applied to 3D data. The structure of the analyzed papers follows a taxonomy, which differentiates the algorithms according to their primary data types and application domains.
Funder
Unitatea Executiva pentru Finantarea Invatamantului Superior, a Cercetarii, Dezvoltarii si Inovarii
Hungarian Science Foundation
H2020 European Research Council
Publisher
Springer Science and Business Media LLC
Reference265 articles.
1. Aberman K, Li P, Lischinski D et al (2020) Skeleton-aware networks for deep motion retargeting. ACM Trans Graph 39(4):62:1-62:14
2. Achlioptas P, Diamanti O, Mitliagkas I et al (2018) Learning Representations and Generative Models for 3D Point Clouds. In: Dy JG, Krause A (eds) Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, Proceedings of Machine Learning Research, vol 80. Proceedings of Machine Learning Research, pp 40–49
3. Akcay S, Atapour-Abarghouei A, Breckon TP (2019) GANomaly: Semi-supervised Anomaly Detection via Adversarial Training. In: Computer Vision—ACCV 2018—14th Asian Conference on Computer Vision, Perth, Australia, December 2–6, 2018, Revised Selected Papers, Part III, vol 11363. Springer, pp 622–637
4. Algazi V, Duda R, Thompson D et al (2001) The CIPIC HRTF Database. In: Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics. IEEE, New Paltz, NY, USA, pp 99–102
5. Ali S, van Kaick O (2021) Evaluation of Latent Space Learning With Procedurally-Generated Datasets of Shapes. In: IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021, Montreal, BC, Canada, October 11-17, 2021. IEEE, online, pp 2086–2094, https://github.com/SharjeelAliCS/3D-latent-space-eval
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Maschinelles Lernen als modernes Werkzeug für Additive Fertigung;Zeitschrift für wirtschaftlichen Fabrikbetrieb;2024-08-16
2. Traversability Map Estimation for Agricultural Robots;2024 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR);2024-05-16