PCGen: A Fully Parallelizable Point Cloud Generative Model
Author:
Vercheval Nicolas12ORCID, Royen Remco3ORCID, Munteanu Adrian3ORCID, Pižurica Aleksandra1ORCID
Affiliation:
1. Research Group for Artificial Intelligence and Sparse Modelling (GAIM), Department of Telecommunications and Information Processing, Faculty of Engineering and Architecture, Ghent University, 9000 Ghent, Belgium 2. Clifford Research Group, Department of Electronics and Information Systems, Faculty of Engineering and Architecture, Ghent University, 9000 Ghent, Belgium 3. Department of Electronics and Informatics (ETRO), Faculty of Engineering, Vrije Universiteit Brussel, 1050 Brussel, Belgium
Abstract
Generative models have the potential to revolutionize 3D extended reality. A primary obstacle is that augmented and virtual reality need real-time computing. Current state-of-the-art point cloud random generation methods are not fast enough for these applications. We introduce a vector-quantized variational autoencoder model (VQVAE) that can synthesize high-quality point clouds in milliseconds. Unlike previous work in VQVAEs, our model offers a compact sample representation suitable for conditional generation and data exploration with potential applications in rapid prototyping. We achieve this result by combining architectural improvements with an innovative approach for probabilistic random generation. First, we rethink current parallel point cloud autoencoder structures, and we propose several solutions to improve robustness, efficiency and reconstruction quality. Notable contributions in the decoder architecture include an innovative computation layer to process the shape semantic information, an attention mechanism that helps the model focus on different areas and a filter to cover possible sampling errors. Secondly, we introduce a parallel sampling strategy for VQVAE models consisting of a double encoding system, where a variational autoencoder learns how to generate the complex discrete distribution of the VQVAE, not only allowing quick inference but also describing the shape with a few global variables. We compare the proposed decoder and our VQVAE model with established and concurrent work, and we prove, one by one, the validity of the single contributions.
Funder
Fonds Wetenschappelijk Onderzoek (FWO) projects Flanders AI Research Programme
Reference45 articles.
1. Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E., and Garnett, R. (2019). Advances in Neural Information Processing Systems, Curran Associates, Inc. 2. Karras, T., Laine, S., and Aila, T. (2019, January 15–20). A Style-Based Generator Architecture for Generative Adversarial Networks. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA. 3. Shi, Z., Peng, S., Xu, Y., Geiger, A., Liao, Y., and Shen, Y. (2022). Deep generative models on 3d representations: A survey. arXiv. 4. Parra Pennefather, P. (2022). Creative Prototyping with Generative AI, Apress. Design Thinking. 5. Awiszus, M., Schubert, F., and Rosenhahn, B. (2021, January 17–20). World-GAN: A Generative Model for Minecraft Worlds. Proceedings of the 2021 IEEE Conference on Games (CoG), Copenhagen, Denmark.
|
|