An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots-Reference-Cited by-同舟云学术

An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots

Published:2024-07-08 Issue: Volume: Page:
ISSN:1868-6478
Container-title:Evolving Systems
language:en
Short-container-title:Evolving Systems

Author:

Cabrera Juan José^ORCID,Céspedes Orlando José,Cebollada Sergio,Reinoso Oscar,Payá Luis

Abstract

AbstractThis work presents an evaluation of CNN models and data augmentation to carry out the hierarchical localization of a mobile robot by using omnidirectional images. In this sense, an ablation study of different state-of-the-art CNN models used as backbone is presented and a variety of data augmentation visual effects are proposed for addressing the visual localization of the robot. The proposed method is based on the adaption and re-training of a CNN with a dual purpose: (1) to perform a rough localization step in which the model is used to predict the room from which an image was captured, and (2) to address the fine localization step, which consists in retrieving the most similar image of the visual map among those contained in the previously predicted room by means of a pairwise comparison between descriptors obtained from an intermediate layer of the CNN. In this sense, we evaluate the impact of different state-of-the-art CNN models such as ConvNeXt for addressing the proposed localization. Finally, a variety of data augmentation visual effects are separately employed for training the model and their impact is assessed. The performance of the resulting CNNs is evaluated under real operation conditions, including changes in the lighting conditions. Our code is publicly available on the project website https://github.com/juanjo-cabrera/IndoorLocalizationSingleCNN.git.

Funder

Ministerio de Universidades

Conselleria de Cultura, Educación y Ciencia, Generalitat Valenciana

Agencia Estatal de Investigación

Universidad Miguel Hernández

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s12530-024-09604-6.pdf

Reference39 articles.

1. Aguilar WG, Luna MA, Moya JF, Abad V, Parra H, Ruiz H (2017) Pedestrian detection for UAVs using cascade classifiers with meanshift. In: 2017 IEEE 11th international conference on semantic computing (ICSC). IEEE, pp 509–514

2. Alfaro M, Cabrera JJ, Jiménez LM, Reinoso Payá L (2024) Hierarchical localization with panoramic views and triplet loss functions. arXiv preprint. arXiv:2404.14117

3. Bai D, Wang C, Zhang B, Yi X, Yang X (2018) CNN feature boosted SeqSLAM for real-time loop closure detection. Chin J Electron 27(3):488–499

4. Ballesta M, Payá L, Cebollada S, Reinoso O, Murcia F (2021) A cnn regression approach to mobile robot localization using omnidirectional images. Appl Sci 11(16):7521

5. Cabrera JJ, Cebollada S, Flores M, Reinoso Ó, Payá L (2022) Training, optimization and validation of a CNN for room retrieval and description of omnidirectional images. SN Comput Sci 3(4):1–13