Efficient knowledge distillation for hybrid models: A vision transformer‐convolutional neural network to convolutional neural network approach for classifying remote sensing images-Reference-Cited by-同舟云学术

Efficient knowledge distillation for hybrid models: A vision transformer‐convolutional neural network to convolutional neural network approach for classifying remote sensing images

Published:2024-07-10 Issue:3 Volume:6 Page:
ISSN:2631-6315
Container-title:IET Cyber-Systems and Robotics
language:en
Short-container-title:IET Cyber-Syst and Robotics

Author:

Song Huaxiang¹^ORCID,Yuan Yuxuan¹,Ouyang Zhiwei¹,Yang Yu¹,Xiang Hui¹

Affiliation:

1. School of Geography Science and Tourism Hunan University of Arts and Science Changde Hunan China

Abstract

AbstractIn various fields, knowledge distillation (KD) techniques that combine vision transformers (ViTs) and convolutional neural networks (CNNs) as a hybrid teacher have shown remarkable results in classification. However, in the realm of remote sensing images (RSIs), existing KD research studies are not only scarce but also lack competitiveness. This issue significantly impedes the deployment of the notable advantages of ViTs and CNNs. To tackle this, the authors introduce a novel hybrid‐model KD approach named HMKD‐Net, which comprises a CNN‐ViT ensemble teacher and a CNN student. Contrary to popular opinion, the authors posit that the sparsity in RSI data distribution limits the effectiveness and efficiency of hybrid‐model knowledge transfer. As a solution, a simple yet innovative method to handle variances during the KD phase is suggested, leading to substantial enhancements in the effectiveness and efficiency of hybrid knowledge transfer. The authors assessed the performance of HMKD‐Net on three RSI datasets. The findings indicate that HMKD‐Net significantly outperforms other cutting‐edge methods while maintaining a significantly smaller size. Specifically, HMKD‐Net exceeds other KD‐based methods with a maximum accuracy improvement of 22.8% across various datasets. As ablation experiments indicated, HMKD‐Net has cut down on time expenses by about 80% in the KD process. This research study validates that the hybrid‐model KD technique can be more effective and efficient if the data distribution sparsity in RSIs is well handled.

Publisher

Institution of Engineering and Technology (IET)

Reference76 articles.

1. Remote sensing in forestry: current challenges, considerations and directions

2. Wetland identification through remote sensing: Insights into wetness, greenness, turbidity, temperature, and changing landscapes

3. Transfer learning in environmental remote sensing

4. Target Detection Model Distillation Using Feature Transition and Label Registration for Remote Sensing Imagery

5. Remote Sensing Object Detection Meets Deep Learning: A metareview of challenges and advances