Affiliation:
1. School of Electronic and Information Engineering Nanjing University of Information Science and Technology Nanjing 210044 China
2. College of Automation Nanjing University of Information Science and Technology Nanjing 210044 China
3. School of Electronic and Information Engineering Wuxi University Wuxi 214105 China
Abstract
AbstractA double branch fusion network is proposed based on unmanned aerial vehicle (UAV) inspection images to increase the detection accuracy of vital components and defects in transmission lines. The backbone feature extraction network comprises a combination of a convolutional neural network (CNN) and a Transformer network. To be specific, the CNN should extract local information, and the Transformer network is responsible for the extraction of global information. Besides, global information and local information have semantic differences, while resulting in feature aliasing after fusion. To solve this problem, a multiscale convolution module and a multiscale pooling module are proposed to solve semantic differences and feature aliasing through the interaction between two types of information. In general, the enhanced feature extraction network comprises a residual‐like convolution module, which can reduce the loss of detailed information (e.g., edge contours) and further extract high‐level semantic information from the deep network. Besides, it performs feature fusion in multiple regions in the enhanced feature extraction network, such that the multi‐scale adaptability of the neural network is effectively enhanced. Last, the fused feature information at different scales is decoded, and the final detection results are yielded.
Subject
Multidisciplinary,Modeling and Simulation,Numerical Analysis,Statistics and Probability