Learning to Interpret Satellite Images using Wikipedia-Reference-Cited by-同舟云学术

Learning to Interpret Satellite Images using Wikipedia

Published:2019-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Uzkent Burak¹,Sheehan Evan¹,Meng Chenlin¹,Tang Zhongyi²,Burke Marshall²,Lobell David²,Ermon Stefano¹

Affiliation:

1. Department of Computer Science, Stanford University

2. Department of Earth Systems Science, Stanford University

Abstract

Despite recent progress in computer vision, fine-grained interpretation of satellite images remains challenging because of a lack of labeled training data. To overcome this limitation, we construct a novel dataset called WikiSatNet by pairing geo-referenced Wikipedia articles with satellite imagery of their corresponding locations. We then propose two strategies to learn representations of satellite images by predicting properties of the corresponding articles from the images. Leveraging this new multi-modal dataset, we can drastically reduce the quantity of human-annotated labels and time required for downstream tasks. On the recently released fMoW dataset, our pre-training strategies can boost the performance of a model pre-trained on ImageNet by up to 4.5% in F1 score.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enriching satellite image annotations of forests with keyphrases from a specialized corpus;Multimedia Tools and Applications;2024-08-13

2. UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web;Proceedings of the ACM Web Conference 2024;2024-05-13

3. Terrain-Informed Self-Supervised Learning: Enhancing Building Footprint Extraction From LiDAR Data With Limited Annotations;IEEE Transactions on Geoscience and Remote Sensing;2024

4. Domain adaptation in segmenting historical maps: A weakly supervised approach through spatial co-occurrence;ISPRS Journal of Photogrammetry and Remote Sensing;2023-03

5. MM-Locate-News: Multimodal Focus Location Estimation in News;MultiMedia Modeling;2023