On the design of deep learning-based control algorithms for visually guided UAVs engaged in power tower inspection tasks-Reference-Cited by-同舟云学术

On the design of deep learning-based control algorithms for visually guided UAVs engaged in power tower inspection tasks

Published:2024-04-26 Issue: Volume:11 Page:
ISSN:2296-9144
Container-title:Frontiers in Robotics and AI
language:
Short-container-title:Front. Robot. AI

Author:

Maitre Guillaume,Martinot Dimitri,Tuci Elio

Abstract

This paper focuses on the design of Convolution Neural Networks to visually guide an autonomous Unmanned Aerial Vehicle required to inspect power towers. The network is required to precisely segment images taken by a camera mounted on a UAV in order to allow a motion module to generate collision-free and inspection-relevant manoeuvres of the UAV along different types of towers. The images segmentation process is particularly challenging not only because of the different structures of the towers but also because of the enormous variability of the background, which can vary from the uniform blue of the sky to the multi-colour complexity of a rural, forest, or urban area. To be able to train networks that are robust enough to deal with the task variability, without incurring into a labour-intensive and costly annotation process of physical-world images, we have carried out a comparative study in which we evaluate the performances of networks trained either with synthetic images (i.e., the synthetic dataset), physical-world images (i.e., the physical-world dataset), or a combination of these two types of images (i.e., the hybrid dataset). The network used is an attention-based U-NET. The synthetic images are created using photogrammetry, to accurately model power towers, and simulated environments modelling a UAV during inspection of different power towers in different settings. Our findings reveal that the network trained on the hybrid dataset outperforms the networks trained with the synthetic and the physical-world image datasets. Most notably, the networks trained with the hybrid dataset demonstrates a superior performance on multiples evaluation metrics related to the image-segmentation task. This suggests that, the combination of synthetic and physical-world images represents the best trade-off to minimise the costs related to capturing and annotating physical-world images, and to maximise the task performances. Moreover, the results of our study demonstrate the potential of photogrammetry in creating effective training datasets to design networks to automate the precise movement of visually-guided UAVs.

Publisher

Frontiers Media SA

Reference47 articles.

1. Path planning techniques for unmanned aerial vehicles: a review, solutions, and challenges;Aggarwal;Comput. Commun.,2020

2. Field coverage for weed mapping: toward experiments with a uav swarm;Albani,2019

3. Segnet: a deep convolutional encoder-decoder architecture for image segmentation;Badrinarayanan;IEEE Trans. pattern analysis Mach. Intell.,2017

4. Unmanned aircraft system path planning for visually inspecting electric transmission towers;Baik;J. Intelligent Robotic Syst.,2019

5. Attention augmented convolutional networks;Bello,2019

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Autonomous UAV navigation using deep learning-based computer vision frameworks: A systematic literature review;Array;2024-09