Neural Radiance Fields for High-Resolution Remote Sensing Novel View Synthesis
-
Published:2023-08-08
Issue:16
Volume:15
Page:3920
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Lv Junwei123, Guo Jiayi123, Zhang Yueting123ORCID, Zhao Xin123ORCID, Lei Bin12
Affiliation:
1. Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China 2. Key Laboratory of Technology in Geo-Spatial Information Processing and Application Systems, Chinese Academy of Sciences, Beijing 100190, China 3. School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 101408, China
Abstract
Remote sensing images play a crucial role in remote sensing target detection and 3D remote sensing modeling, and the enhancement of resolution holds significant application implications. The task of remote sensing target detection requires a substantial amount of high-resolution remote sensing images, while 3D reconstruction tasks generate denser models from diverse view perspectives. However, high-resolution remote sensing images are often limited due to their high acquisition costs, a scarcity of acquisition views, and restricted view perspective variations, which pose challenges for remote sensing tasks. In this paper, we propose an advanced method for a high-resolution remote sensing novel view synthesis by integrating attention mechanisms with neural radiance fields to address the scarcity of high-resolution remote sensing images. To enhance the relationships between sampled points and rays and to improve the 3D implicit model representation capability of the network, we introduce a point attention module and batch attention module into the proposed framework. Additionally, a frequency-weighted position encoding strategy is proposed to determine the significance of each frequency for position encoding. The proposed method is evaluated on the LEVIR-NVS dataset and demonstrates superior performance in quality assessment metrics and visual effects compared to baseline NeRF (Neural Radiance Fields) and ImMPI (Implicit Multi-plane Images). Overall, this work presents a promising approach for a remote sensing novel view synthesis by leveraging attention mechanisms and frequency-weighted position encoding.
Funder
The National Natural Science Foundation of China Key Research and Development Program of Aerospace Information Research Institute Chinese Academy of Sciences
Subject
General Earth and Planetary Sciences
Reference43 articles.
1. Heritage Recording and 3D Modeling with Photogrammetry and 3D Scanning;Remondino;Remote Sens.,2011 2. Schonberger, J.L., and Frahm, J.M. (2016, January 27–30). Structure-from-motion revisited. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 3. Kanazawa, A., Tulsiani, S., Efros, A.A., and Malik, J. (2018, January 8–14). Learning category-specific mesh reconstruction from image collections. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany. 4. Wang, N., Zhang, Y., Li, Z., Fu, Y., Liu, W., and Jiang, Y.G. (2018, January 8–14). Pixel2mesh: Generating 3D mesh models from single rgb images. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany. 5. Groueix, T., Fisher, M., Kim, V.G., Russell, B.C., and Aubry, M. (2018, January 18–23). A papier-mâché approach to learning 3D surface generation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|