Affiliation:
1. Google Inc., Mountain View, USA
2. University of Pittsburgh, Pittsburgh, USA
Abstract
Three-dimensional (3D) urban models have gained interest because of their applications in many use cases, such as disaster management, energy management, and solar potential analysis. However, generating these 3D representations of buildings require lidar data, which is usually expensive to collect. Consequently, the lidar data are not frequently updated and are not widely available for many regions in the US. As such, 3D models based on these lidar data are either outdated or limited to those locations where the data is available. In contrast, satellite images are freely available and frequently updated. We propose
sat2Map
, a novel deep learning-based approach that predicts building roof geometries and heights directly from a single 2D satellite image. Our method first uses
sat2pc
to predict the point cloud by integrating two distinct loss functions, Chamfer Distance and Earth Mover’s Distance, resulting in a 3D point cloud output that balances overall structure and finer details. Additionally, we introduce
sat2height
, a height estimation model that estimates the height of the predicted point cloud to generate the final 3D building structure for a given location. We extensively evaluate our model on a building roof dataset and conduct ablation studies to analyze its performance. Our results demonstrate that
sat2Map
consistently outperforms existing baseline methods by at least 18.6%. Furthermore, we show that our refinement module significantly improves the overall performance, yielding more accurate and fine-grained 3D outputs. Our
sat2height
model demonstrates a high accuracy in predicting height parameters with a low error rate. Furthermore, our evaluation results show that we can estimate building heights with a median mean absolute error of less than 30 cm while still preserving the overall structure of the building.
Publisher
Association for Computing Machinery (ACM)
Reference43 articles.
1. Fatemeh Alidoost, Hossein Arefi, and Federico Tombari. 2019. 2D image-to-3D model: knowledge-based 3D building reconstruction (3DBR) using single aerial images and convolutional neural networks (CNNs). Remote Sensing (2019).
2. An Automatic and Threshold-Free Performance Evaluation System for Building Extraction Techniques From Airborne LIDAR Data
3. The SpaceNet Catalog. 2018. SpaceNet on Amazon Web Services (AWS). https://spacenetchallenge.github.io/datasets/datasetHomePage.html.
4. The SpaceNet Catalog. 2019. 2d semantic labeling contest - vaihingen. http: //www2.isprs.org/commissions/comm3/wg4/ 2d-sem-label-vaihingen.html.
5. Liuyun Duan and Florent Lafarge. 2016. Towards large-scale city reconstruction from satellites. In European Conference on Computer Vision. Springer, 89–104.