Advances in Data‐Driven Analysis and Synthesis of 3D Indoor Scenes-Reference-Cited by-同舟云学术

Advances in Data‐Driven Analysis and Synthesis of 3D Indoor Scenes

Published:2023-09-11 Issue: Volume: Page:
ISSN:0167-7055
Container-title:Computer Graphics Forum
language:en
Short-container-title:Computer Graphics Forum

Author:

Patil Akshay Gadi¹,Patil Supriya Gadi¹,Li Manyi²^ORCID,Fisher Matthew³,Savva Manolis¹,Zhang Hao¹

Affiliation:

1. Simon Fraser University Burnaby Canada

2. Shandong University Jinan China

3. Adobe Research San Francisco USA

Abstract

AbstractThis report surveys advances in deep learning‐based modelling techniques that address four different 3D indoor scene analysis tasks, as well as synthesis of 3D indoor scenes. We describe different kinds of representations for indoor scenes, various indoor scene datasets available for research in the aforementioned areas, and discuss notable works employing machine learning models for such scene modelling tasks based on these representations. Specifically, we focus on the analysis and synthesis of 3D indoor scenes. With respect to analysis, we focus on four basic scene understanding tasks – 3D object detection, 3D scene segmentation, 3D scene reconstruction and 3D scene similarity. And for synthesis, we mainly discuss neural scene synthesis works, though also highlighting model‐driven methods that allow for human‐centric, progressive scene synthesis. We identify the challenges involved in modelling scenes for these tasks and the kind of machinery that needs to be developed to adapt to the data representation, and the task setting in general. For each of these tasks, we provide a comprehensive summary of the state‐of‐the‐art works across different axes such as the choice of data representation, backbone, evaluation metric, input, output and so on, providing an organized review of the literature. Towards the end, we discuss some interesting research directions that have the potential to make a direct impact on the way users interact and engage with these virtual scene models, making them an integral part of the metaverse.

Funder

Natural Sciences and Engineering Research Council of Canada

Publisher

Wiley

Subject

Computer Graphics and Computer-Aided Design

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/cgf.14927

Reference159 articles.

1. [ADD*19] AvetisyanA. DahnertM. DaiA. SavvaM. ChangA. X. NießnerM.:Scan2cad: Learning cad model alignment in rgb‐d scans. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(2019) pp.2614–2623.

2. [ADN19] AvetisyanA. DaiA. NießnerM.:End‐to‐end cad model retrieval and 9dof alignment in 3d scans. InProceedings of the IEEE/CVF International Conference on computer vision(2019) pp.2551–2560.

3. [AGSK20] AggarwalM. GuptaH. SarkarM. KrishnamurthyB.:Form2seq: A framework for higher‐order form structure extraction. InProceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)(2020) pp.3830–3840.

4. [AKC*20] AvetisyanA. KhanovaT. ChoyC. DashD. DaiA. NießnerM.:Scenecad: Predicting object alignments and layouts in rgb‐d scans. InEuropean Conference on Computer Vision(2020) Springer pp.596–612.

5. [AW18] AlhashimI. WonkaP.:High quality monocular depth estimation via transfer learning.arXiv preprint arXiv:1812.11941(2018).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Bird’s-Eye-View Scene Graph for Vision-Language Navigation;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01