Transformers for Remote Sensing: A Systematic Review and Analysis
Author:
Wang Ruikun12ORCID, Ma Lei3ORCID, He Guangjun12, Johnson Brian4ORCID, Yan Ziyun3, Chang Ming12, Liang Ying12
Affiliation:
1. Beijing Institute of Satellite Information Engineering, Beijing 100095, China 2. State Key Laboratory of Space-Ground Integrated Information Technology, Space Star Technology Co., Ltd., Beijing 100095, China 3. Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Key Laboratory for Land Satellite Remote Sensing Applications of Ministry of Natural Resources, School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China 4. Natural Resources and Ecosystem Services, Institute for Global Environmental Strategies, 2108-11, Kamiyamaguchi, Hayama, Kanagawa 240-0115, Japan
Abstract
Research on transformers in remote sensing (RS), which started to increase after 2021, is facing the problem of a relative lack of review. To understand the trends of transformers in RS, we undertook a quantitative analysis of the major research on transformers over the past two years by dividing the application of transformers into eight domains: land use/land cover (LULC) classification, segmentation, fusion, change detection, object detection, object recognition, registration, and others. Quantitative results show that transformers achieve a higher accuracy in LULC classification and fusion, with more stable performance in segmentation and object detection. Combining the analysis results on LULC classification and segmentation, we have found that transformers need more parameters than convolutional neural networks (CNNs). Additionally, further research is also needed regarding inference speed to improve transformers’ performance. It was determined that the most common application scenes for transformers in our database are urban, farmland, and water bodies. We also found that transformers are employed in the natural sciences such as agriculture and environmental protection rather than the humanities or economics. Finally, this work summarizes the analysis results of transformers in remote sensing obtained during the research process and provides a perspective on future directions of development.
Funder
National Natural Science Foundation of China Key Laboratory of Land Satellite Remote Sensing Application, Ministry of Natural Resources of the People’s Republic of China Beijing Nova Program Open Research Fund of the State Key Laboratory of Space–Earth Integrated Information Technology
Reference94 articles.
1. Deep Learning in Remote Sensing Applications: A Meta-Analysis and Review;Ma;ISPRS J. Photogramm.,2019 2. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2021, January 4). An Image Is Worth 16 × 16 Words: Transformers for Image Recognition at Scale. Proceedings of the International Conference on Learning Representations, Vienna, Austria. 3. Zhu, X., Su, W., Lu, L., Li, B., Wang, X., and Dai, J. (2021, January 4). Deformable DETR: Deformable Transformers for End-to-End Object Detection. Proceedings of the International Conference on Learning Representations, Vienna, Austria. 4. Zheng, S., Lu, J., Zhao, H., Zhu, X., Luo, Z., Wang, Y., Fu, Y., Feng, J., Xiang, T., and Torr, P.H.S. (2021, January 19–25). Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers. Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual. 5. HSI-BERT: Hyperspectral Image Classification Using the Bidirectional Encoder Representation From Transformers;He;IEEE Trans. Geosci. Remote Sens.,2020
|
|