Image Detection Network Based on Enhanced Small Target Recognition Details and Its Application in Fine Granularity-Reference-Cited by-同舟云学术

Image Detection Network Based on Enhanced Small Target Recognition Details and Its Application in Fine Granularity

Published:2024-06-04 Issue:11 Volume:14 Page:4857
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Fu Qiang¹,Tao Xiaoping¹,Deng Weijie¹,Liu Hongliang¹^ORCID

Affiliation:

1. Key Laboratory of Advanced Manufacturing for Optical Systems (Chinese Academy of Sciences), Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Science, Changchun 130033, China

Abstract

Image detection technology is of paramount importance across various fields. This significance is not only seen in general images with everyday scenes but also holds substantial research value in the field of remote sensing. Remote sensing images involve capturing images from aircraft or satellites. These images typically feature diverse scenes, large image formats, and varying imaging heights, thus leading to numerous small-sized targets in the captured images. Accurately identifying these small targets, which may occupy only a few pixels, is a challenging and active research area. Current methods mainly fall into two categories: enhancing small target features by improving resolution and increasing the number of small targets to bolster training datasets. However, these approaches often fail to address the core distinguishing features of small targets in the original images, thus resulting in suboptimal performance in fine-grained classification tasks. To address this situation, we propose a new network structure DDU (Downsample Difference Upsample), which is based on differential and resolution changing methods in the Neck layer of deep learning networks to enhance the recognition features of small targets, thus further improving the feature richness of recognition and effectively solving the problem of low accuracy in small target object recognition. At the same time, in order to take into account the recognition effect of targets of other sizes in the image, a new attention mechanism called PNOC (protecting the number of channels) is proposed, which integrates small target features and universal object features without losing the number of channels, thereby increasing the accuracy of recognition. And experimental verification was conducted on the PASCAL-VOC dataset. At the same time, it was applied to the testing of the fine-grained MAR20 dataset and found that the performance was better than other classic algorithms. At the same time, because the proposed framework belongs to a one-stage detection method, it has good engineering applicability and scalability, and universality in scientific research applications are good. Through comparative experiments, it was found that our algorithm improved the performance of the mAP by 0.7% compared to the original YOLOv8 algorithm.

Funder

National Natural Science Foundation of China

Jilin Province Innovation and Entrepreneurial Talent Project

Youth innovation promotion association cas

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/11/4857/pdf

Reference39 articles.

1. Xiao, J., Zhao, T., Yao, Y., Yu, Q., and Chen, Y. (2023, June 23). Context Augmentation and Feature Refinement Network for Tiny Object Detection. Available online: https://paperswithcode.com/paper/context-augmentation-and-feature-refinement.

2. Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv.

3. Stitcher: Feedback-driven data provider for object detection;Chen;arXiv,2020

4. PCB electronic component defect detection method based on improved YOLOv4 algorithm;Xin;J. Phys. Conf. Ser.,2021

5. (2023, July 12). Microsoft COCO: Common Objects in Context. Available online: https://github.com/pjreddie/darknet/tree/master/scripts/get_coco_dataset.sh.