Multimodal Few-Shot Target Detection Based on Uncertainty Analysis in Time-Series Images

Author:

Khoshboresh-Masouleh MehdiORCID,Shah-Hosseini RezaORCID

Abstract

The ability to interpret multimodal data, and map the targets and anomalies within, is important for an automatic recognition system. Due to the expensive and time-consuming nature of multimodal time-series data annotation in the training stage, multimodal time-series image understanding, from drone and quadruped mobile robot platforms, is a challenging task for remote sensing and photogrammetry. In this regard, robust methods must be computationally low-cost, due to the limited data on aerial and ground-based platforms, yet accurate enough to meet certainty measures. In this study, a few-shot learning architecture, based on a squeeze-and-attention structure, is proposed for multimodal target detection, using time-series images from the drone and quadruped robot platforms with a small training dataset. To build robust algorithms in target detection, a squeeze-and-attention structure has been developed from multimodal time-series images from limited training data as an optimized method. The proposed architecture was validated on three datasets with multiple modalities (e.g., red-green-blue, color-infrared, and thermal), achieving competitive results.

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering

Reference45 articles.

1. Multiscale Anti-Deformation Network for Target Tracking in UAV Aerial Videos;Bi;JARS,2022

2. Vehicle Detection Method for Satellite Videos Based on Enhanced Vehicle Features;Lv;JARS,2022

3. Ghosh, U., Maleh, Y., Alazab, M., and Pathan, A.-S.K. (2021). Machine Intelligence and Data Analytics for Sustainable Future Smart Cities, Springer International Publishing. Studies in Computational Intelligence.

4. Performance of a Modified YOLOv3 Object Detector on Remotely Piloted Aircraft System Acquired Full Motion Video;Faraj;JARS,2022

5. Han, G., Ma, J., Huang, S., Chen, L., Chellappa, R., and Chang, S.-F. (2022). Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting. arXiv.

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3