Hybrid fuzzy AHP–TOPSIS approach to prioritizing solutions for inverse reinforcement learning-Reference-Cited by-同舟云学术

Hybrid fuzzy AHP–TOPSIS approach to prioritizing solutions for inverse reinforcement learning

Published:2022-07-20 Issue:1 Volume:9 Page:493-513
ISSN:2199-4536
Container-title:Complex & Intelligent Systems
language:en
Short-container-title:Complex Intell. Syst.

Author:

Kukreja Vinay^ORCID

Abstract

AbstractReinforcement learning (RL) techniques nurture building up solutions for sequential decision-making problems under uncertainty and ambiguity. RL has agents with a reward function that interacts with a dynamic environment to find out an optimal policy. There are problems associated with RL like the reward function should be specified in advance, design difficulties and unable to handle large complex problems, etc. This led to the development of inverse reinforcement learning (IRL). IRL also suffers from many problems in real life like robust reward functions, ill-posed problems, etc., and different solutions have been proposed to solve these problems like maximum entropy, support for multiple rewards and non-linear reward functions, etc. There are majorly eight problems associated with IRL and eight solutions have been proposed to solve IRL problems. This paper has proposed a hybrid fuzzy AHP–TOPSIS approach to prioritize the solutions while implementing IRL. Fuzzy Analytical Hierarchical Process (FAHP) is used to get the weights of identified problems. The relative accuracy and root-mean-squared error using FAHP are 97.74 and 0.0349, respectively. Fuzzy Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) uses these FAHP weights to prioritize the solutions. The most significant problem in IRL implementation is of ‘lack of robust reward functions’ weighting 0.180, whereas the most significant solution in IRL implementation is ‘Supports optimal policy and rewards functions along with stochastic transition models’ having closeness of coefficient (CofC) value of 0.967156846.

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s40747-022-00807-5.pdf

Reference89 articles.

1. Zhifei S, Joo EM (2012) A survey of inverse reinforcement learning techniques. Int J Intell Comput Cybern 5(3):293–311. https://doi.org/10.1108/17563781211255862

2. Argall BD, Chernova S, Veloso M, Browning B (2009) A survey of robot learning from demonstration. Robot Auton Syst 57(5):469–483. https://doi.org/10.1016/j.robot.2008.10.024

3. Datta P, Sharma B (2017) A survey on IoT architectures, protocols, security and smart city based applications. In: 8th IEEE International Conference on Computing, Communications and Networking Technologies, ICCCNT 2017, 1–5. https://doi.org/10.1109/ICCCNT.2017.8203943

4. Schaal S (1999) Is imitation learning the route to humanoid robots? Trends Cogn Sci 3(6):97–114. https://doi.org/10.1007/978-3-319-15425-1_6

5. Jara-Ettinger J (2019) Theory of mind as inverse reinforcement learning. Curr Opin Behav Sci 29:105–110. https://doi.org/10.1016/j.cobeha.2019.04.010

Cited by 47 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficacy of CNN-LSTM Structures in the Identification of Subtle Motion Variations in Tennis Serve;2024 2nd International Conference on Sustainable Computing and Smart Systems (ICSCSS);2024-07-10

2. Synergizing CNN and Random Forest for Accurate Cattle Disease Identification;2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS);2024-06-28

3. Fruitful Fusion: CNN-Random Forest Synergy in Banana Ripeness Detection;2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS);2024-06-28

4. Navigating Water Scarcity with IoT: A Smart Management System Approach;2023 4th International Conference on Intelligent Technologies (CONIT);2024-06-21

5. Harmonic Harmony: Advancing Musical Instrument Classification through CNN-SVM;2023 4th International Conference on Intelligent Technologies (CONIT);2024-06-21