Building Better Models: Benchmarking Feature Extraction and Matching for Structure from Motion at Construction Sites
-
Published:2024-08-14
Issue:16
Volume:16
Page:2974
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Cueto Zumaya Carlos Roberto1ORCID, Catalano Iacopo1ORCID, Queralta Jorge Peña12ORCID
Affiliation:
1. Department of Computing, Faculty of Technology, University of Turku, 20014 Turku, Finland 2. Institute of Robotics and Intelligent Systems, ETH Zurich, 8092 Zurich, Switzerland
Abstract
The popularity of Structure from Motion (SfM) techniques has significantly advanced 3D reconstruction in various domains, including construction site mapping. Central to SfM, is the feature extraction and matching process, which identifies and correlates keypoints across images. Previous benchmarks have assessed traditional and learning-based methods for these tasks but have not specifically focused on construction sites, often evaluating isolated components of the SfM pipeline. This study provides a comprehensive evaluation of traditional methods (e.g., SIFT, AKAZE, ORB) and learning-based methods (e.g., D2-Net, DISK, R2D2, SuperPoint, SOSNet) within the SfM pipeline for construction site mapping. It also compares matching techniques, including SuperGlue and LightGlue, against traditional approaches such as nearest neighbor. Our findings demonstrate that deep learning-based methods such as DISK with LightGlue and SuperPoint with various matchers consistently outperform traditional methods like SIFT in both reconstruction quality and computational efficiency. Overall, the deep learning methods exhibited better adaptability to complex construction environments, leveraging modern hardware effectively, highlighting their potential for large-scale and real-time applications in construction site mapping. This benchmark aims to assist researchers in selecting the optimal combination of feature extraction and matching methods for SfM applications at construction sites.
Reference56 articles.
1. Building rome in a day;Agarwal;Commun. ACM,2011 2. Frahm, J.M., Fite-Georgel, P., Gallup, D., Johnson, T., Raguram, R., Wu, C., Jen, Y.H., Dunn, E., Clipp, B., and Lazebnik, S. (2010, January 5–11). Building rome on a cloudless day. Proceedings of the Computer Vision–ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece. Proceedings, Part IV 11. 3. Heinly, J., Schonberger, J.L., Dunn, E., and Frahm, J.M. (2015, January 7–12). Reconstructing the world* in six days*(as captured by the yahoo 100 million image dataset). Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA. 4. Schonberger, J.L., and Frahm, J.M. (2016, January 27–30). Structure-from-motion revisited. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 5. Applications of structure from motion: A survey;Wei;J. Zhejiang Univ. Sci. C,2013
|
|