Abstract
AbstractThe emergence of the metaverse has led to the rapidly increasing demand for the generation of extensive 3D worlds. We consider that an engaging world is built upon a rational layout of multiple land-use areas (e.g., forest, meadow, and farmland). To this end, we propose a generative model of land-use distribution that learns from geographic data. The model is based on a transformer architecture that generates a 2D map of the land-use layout, which can be conditioned on spatial and semantic controls, depending on whether either one or both are provided. This model enables diverse layout generation with user control and layout expansion by extending borders with partial inputs. To generate high-quality and satisfactory layouts, we devise a geometric objective function that supervises the model to perceive layout shapes and regularize generations using geometric priors. Additionally, we devise a planning objective function that supervises the model to perceive progressive composition demands and suppress generations deviating from controls. To evaluate the spatial distribution of the generations, we train an autoencoder to embed land-use layouts into vectors to enable comparison between the real and generated data using the Wasserstein metric, which is inspired by the Fréchet inception distance.
Publisher
Springer Science and Business Media LLC
Reference45 articles.
1. Dionisio, J. D. N.; Burns III, W. G.; Gilbert, R. 3D virtual worlds and the metaverse: Current status and possibilities. ACM Computing Surveys Vol. 45, No. 3, Article No. 34, 2013.
2. Li, J. N.; Yang, J. M.; Hertzmann, A.; Zhang, J. M.; Xu, T. F. LayoutGAN: Generating graphic layouts with wireframe discriminators. arXiv preprint arXiv:1901.06767, 2019.
3. Jyothi, A. A.; Durand, T.; He, J. W.; Sigal, L.; Mori, G. LayoutVAE: Stochastic scene layout generation from a label set. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 9894–9903, 2019.
4. Gupta, K.; Lazarow, J.; Achille, A.; Davis, L.; Mahadevan, V.; Shrivastava, A. LayoutTransformer: Layout generation and completion with self-attention. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 984–994, 2021.
5. Arroyo, D. M.; Postels, J.; Tombari, F. Variational transformer networks for layout generation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 13637–13647, 2021.