Contrasting Dual Transformer Architectures for Multi-Modal Remote Sensing Image Retrieval-Reference-Cited by-同舟云学术

Contrasting Dual Transformer Architectures for Multi-Modal Remote Sensing Image Retrieval

Published:2022-12-26 Issue:1 Volume:13 Page:282
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Rahhal Mohamad M. Al^ORCID,Bencherif Mohamed Abdelkader,Bazi Yakoub^ORCID,Alharbi Abdullah,Mekhalfi Mohamed Lamine^ORCID

Abstract

Remote sensing technology has advanced rapidly in recent years. Because of the deployment of quantitative and qualitative sensors, as well as the evolution of powerful hardware and software platforms, it powers a wide range of civilian and military applications. This in turn leads to the availability of large data volumes suitable for a broad range of applications such as monitoring climate change. Yet, processing, retrieving, and mining large data are challenging. Usually, content-based remote sensing image (RS) retrieval approaches rely on a query image to retrieve relevant images from the dataset. To increase the flexibility of the retrieval experience, cross-modal representations based on text–image pairs are gaining popularity. Indeed, combining text and image domains is regarded as one of the next frontiers in RS image retrieval. Yet, aligning text to the content of RS images is particularly challenging due to the visual-sematic discrepancy between language and vision worlds. In this work, we propose different architectures based on vision and language transformers for text-to-image and image-to-text retrieval. Extensive experimental results on four different datasets, namely TextRS, Merced, Sydney, and RSICD datasets are reported and discussed.

Funder

King Saud University

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/1/282/pdf

Reference34 articles.

1. Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities;Cheng;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2020

2. Toward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspective;Hoxha;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2020

3. Mapping Crop Types in Complex Farming Areas Using SAR Imagery with Dynamic Time Warping;Gella;ISPRS J. Photogramm. Remote Sens.,2021

4. A Decision-Level Fusion Approach to Tree Species Classification from Multi-Source Remotely Sensed Data;Hu;ISPRS Open J. Photogramm. Remote Sens.,2021

5. M3C2-EP: Pushing the Limits of 3D Topographic Point Cloud Change Detection by Error Propagation;Winiwarter;ISPRS J. Photogramm. Remote Sens.,2021

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Intra-Class Ranking Metric for Remote Sensing Image Retrieval;Remote Sensing;2023-08-09

2. Exploring Uni-Modal Feature Learning on Entities and Relations for Remote Sensing Cross-Modal Text-Image Retrieval;IEEE Transactions on Geoscience and Remote Sensing;2023