FEATURE FUSION FOR CROSS-MODAL SCENE CLASSIFICATION OF REMOTE SENSING IMAGE-Reference-Cited by-同舟云学术

FEATURE FUSION FOR CROSS-MODAL SCENE CLASSIFICATION OF REMOTE SENSING IMAGE

Published:2021-08-10 Issue: Volume:XLIV-M-3-2021 Page:63-66
ISSN:2194-9034
Container-title:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
language:en
Short-container-title:Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

Author:

Geng W.,Zhou W.,Jin S.^ORCID

Abstract

Abstract. Scene classification plays an important role in remote sensing field. Traditional approaches use high-resolution remote sensing images as data source to extract powerful features. Although these kind of methods are common, the model performance is severely affected by the image quality of the dataset, and the single modal (source) of images tend to cause the mission of some scene semantic information, which eventually degrade the classification accuracy. Nowadays, multi-modal remote sensing data become easy to obtain since the development of remote sensing technology. How to carry out scene classification of cross-modal data has become an interesting topic in the field. To solve the above problems, this paper proposes using feature fusion for cross-modal scene classification of remote sensing image, i.e., aerial and ground street view images, expecting to use the advantages of aerial images and ground street view data to complement each other. Our cross- modal model is based on Siamese Network. Specifically, we first train the cross-modal model by pairing different sources of data with aerial image and ground data. Then, the trained model is used to extract the deep features of the aerial and ground image pair, and the features of the two perspectives are fused to train a SVM classifier for scene classification. Our approach has been demonstrated using two public benchmark datasets, AiRound and CV-BrCT. The preliminary results show that the proposed method achieves state-of-the-art performance compared with the traditional methods, indicating that the information from ground data can contribute to aerial image classification.

Publisher

Copernicus GmbH

Link

https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLIV-M-3-2021/63/2021/isprs-archives-XLIV-M-3-2021-63-2021.pdf

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Resolution invariant urban scene classification using Multiview learning paradigm;Digital Signal Processing;2023-07

2. Multi-View Urban Scene Classification with a Complementary-Information Learning Model;Photogrammetric Engineering & Remote Sensing;2022-01-01