Towards the portability of knowledge in reinforcement learning-based systems for automatic drone navigation

Author:

Barreiro José M.1,Lara Juan A.2,Manrique Daniel1,Smith Peter3

Affiliation:

1. Departamento de Inteligencia Artificial, Universidad Politécnica de Madrid, Madrid, Spain

2. Department of Computer Science and Numerical Analysis, Universidad de Córdoba, Córdoba, Spain

3. University of Sunderland, Sunderland, United Kingdom

Abstract

In the field of artificial intelligence (AI) one of the main challenges today is to make the knowledge acquired when performing a certain task in a given scenario applicable to similar yet different tasks to be performed with a certain degree of precision in other environments. This idea of knowledge portability is of great use in Cyber-Physical Systems (CPS) that face important challenges in terms of reliability and autonomy. This article presents a CPS where unmanned vehicles (drones) are equipped with a reinforcement learning system so they may automatically learn to perform various navigation tasks in environments with physical obstacles. The implemented system is capable of isolating the agents’ knowledge and transferring it to other agents that do not have prior knowledge of their environment so they may successfully navigate environments with obstacles. A complete study has been performed to ascertain the degree to which the knowledge obtained by an agent in a scenario may be successfully transferred to other agents in order to perform tasks in other scenarios without prior knowledge of the same, obtaining positive results in terms of the success rate and learning time required to complete the task set in each case. In particular, those two indicators showed better results (higher success rate and lower learning time) with our proposal compared to the baseline in 47 out of the 60 tests conducted (78.3%).

Publisher

PeerJ

Subject

General Computer Science

Reference34 articles.

1. Autonomous navigation via deep reinforcement learning for resource constraint edge nodes using transfer learning;Anwar;IEEE Access,2020

2. Uncertainty handling in cyber–physical systems: state-of-the-art approaches, tools, causes, and future directions;Asmat;Journal of Software: Evolution and Process,2022

3. Neuronlike elements that can solve difficult learning control problems;Barto;IEEE Transactions on Systems, Man, and Cybernetics,1983

4. A deep reinforcement learning approach for solving the traveling salesman problem with drone;Bogyrbayeva;Transportation Research Part C: Emerging Technologies,2023

5. Deep reinforcement learning and its neuroscientific implications;Botvinick;Neuron,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3