Scene Graph Masked Variational Autoencoders for 3D Scene Generation-Reference-Cited by-同舟云学术

Scene Graph Masked Variational Autoencoders for 3D Scene Generation

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Xu Rui¹^ORCID,Hui Le¹^ORCID,Han Yuehui¹^ORCID,Qian Jianjun¹^ORCID,Xie Jin¹^ORCID

Affiliation:

1. Nanjing University of Science and Techonology, Nanjing, China

Funder

National Natural Science Foundation of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3612262

Reference39 articles.

1. Sara Atito , Muhammad Awais , and Josef Kittler . 2021 . Sit: Self-supervised vision transformer. arXiv preprint arXiv:2104.03602 (2021). Sara Atito, Muhammad Awais, and Josef Kittler. 2021. Sit: Self-supervised vision transformer. arXiv preprint arXiv:2104.03602 (2021).

2. Learning Spatial Knowledge for Text to 3D Scene Generation

3. Angel X Chang , Mihail Eric , Manolis Savva , and Christopher D Manning . 2017. SceneSeer: 3D scene design with natural language. arXiv preprint arXiv:1703.00050 ( 2017 ). Angel X Chang, Mihail Eric, Manolis Savva, and Christopher D Manning. 2017. SceneSeer: 3D scene design with natural language. arXiv preprint arXiv:1703.00050 (2017).

4. Mark Chen , Alec Radford , Rewon Child , Jeffrey Wu , Heewoo Jun , David Luan , and Ilya Sutskever . 2020 . Generative pretraining from pixels . In International conference on machine learning. 1691--1703 . Mark Chen, Alec Radford, Rewon Child, Jeffrey Wu, Heewoo Jun, David Luan, and Ilya Sutskever. 2020. Generative pretraining from pixels. In International conference on machine learning. 1691--1703.

5. Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).