Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot-Reference-Cited by-同舟云学术

Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran—A Polyabolo-Inspired Self-Reconfigurable Tiling Robot

Published:2021-04-07 Issue:8 Volume:21 Page:2577
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Le Anh Vu^ORCID,Veerajagadheswar Prabakaran,Thiha Kyaw Phone^ORCID,Elara Mohan Rajesh^ORCID,Nhan Nguyen Huu Khanh^ORCID

Abstract

One of the critical challenges in deploying the cleaning robots is the completion of covering the entire area. Current tiling robots for area coverage have fixed forms and are limited to cleaning only certain areas. The reconfigurable system is the creative answer to such an optimal coverage problem. The tiling robot’s goal enables the complete coverage of the entire area by reconfiguring to different shapes according to the area’s needs. In the particular sequencing of navigation, it is essential to have a structure that allows the robot to extend the coverage range while saving energy usage during navigation. This implies that the robot is able to cover larger areas entirely with the least required actions. This paper presents a complete path planning (CPP) for hTetran, a polyabolo tiled robot, based on a TSP-based reinforcement learning optimization. This structure simultaneously produces robot shapes and sequential trajectories whilst maximizing the reward of the trained reinforcement learning (RL) model within the predefined polyabolo-based tileset. To this end, a reinforcement learning-based travel sales problem (TSP) with proximal policy optimization (PPO) algorithm was trained using the complementary learning computation of the TSP sequencing. The reconstructive results of the proposed RL-TSP-based CPP for hTetran were compared in terms of energy and time spent with the conventional tiled hypothetical models that incorporate TSP solved through an evolutionary based ant colony optimization (ACO) approach. The CPP demonstrates an ability to generate an ideal Pareto optima trajectory that enhances the robot’s navigation inside the real environment with the least energy and time spent in the company of conventional techniques.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/8/2577/pdf

Reference41 articles.

1. Mobile Robot Navigation and Obstacle Avoidance Techniques: A Review

2. Designing Autonomous Mobile Robots: Inside the Mind of an Intelligent Machine;Holland,2004

3. Self-reconfigurable robots

4. Event-Triggered Decentralized Tracking Control of Modular Reconfigurable Robots Through Adaptive Dynamic Programming

5. Optimization Complete Area Coverage by Reconfigurable hTrihex Tiling Robot

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Multi-Area Task Path-Planning Algorithm for Agricultural Drones Based on Improved Double Deep Q-Learning Net;Agriculture;2024-08-05

2. Combining reinforcement learning algorithm and genetic algorithm to solve the traveling salesman problem;The Journal of Engineering;2024-05-29

3. Complete coverage path planning for reconfigurable omni-directional mobile robots with varying width using GBNN(n);Expert Systems with Applications;2023-10

4. A Deep Reinforcement Learning Approach to Optimal Morphologies Generation in Reconfigurable Tiling Robots;Mathematics;2023-09-13

5. Path planning for obstacle avoidance of unmanned vehicles based on Frenet coordinates and B-spline curves;Eighth International Conference on Electromechanical Control Technology and Transportation (ICECTT 2023);2023-09-07