Opposition-Based Reinforcement Learning-Reference-Cited by-同舟云学术

Opposition-Based Reinforcement Learning

Published:2006-07-20 Issue:4 Volume:10 Page:578-585
ISSN:1883-8014
Container-title:Journal of Advanced Computational Intelligence and Intelligent Informatics
language:en
Short-container-title:JACIII

Author:

Tizhoosh Hamid R.,

Abstract

Reinforcement learning is a machine intelligence scheme for learning in highly dynamic, probabilistic environments. By interaction with the environment, reinforcement agents learn optimal control policies, especially in the absence of a priori knowledge and/or a sufficiently large amount of training data. Despite its advantages, however, reinforcement learning suffers from a major drawback - high calculation cost because convergence to an optimal solution usually requires that all states be visited frequently to ensure that policy is reliable. This is not always possible, however, due to the complex, high-dimensional state space in many applications. This paper introduces opposition-based reinforcement learning, inspired by opposition-based learning, to speed up convergence. Considering opposite actions simultaneously enables individual states to be updated more than once shortening exploration and expediting convergence. Three versions of Q-learning algorithm will be given as examples. Experimental results for the grid world problem of different sizes demonstrate the superior performance of the proposed approach.

Publisher

Fuji Technology Press Ltd.

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction

Reference14 articles.

1. A. G. Barto, R. S. Sutton, and P. S. Brouwer, “Associative search network: A reinforcement learning associative memory,” Biological Cybernetics, Vol.40, No.3, pp. 201-211, May, 1981.

2. A. W. Beggs, “On the convergence of reinforcement learning,” Journal of Economic Theory, Vol.122, Issue 1, pp. 1-36, May, 2005.

3. K. Driessens, J. Ramon, and H. Blockeel, “Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner,” Proc. 12th European Conference on Machine Learning, Freiburg, Germany, September, 2001.

4. C. Drummond, “Composing functions to speed up reinforcement learning in a changing world,” Proc. 10th European Conference on Machine Learning, Springer-Verlag, 1998.

5. S. Dzeroski, L. De Raedt, and K. Driessens, “Relational Reinforcement Learning,” Machine Learning Vol.43, Issue 1-2, pp. 7-52, April-May, 2001.

Cited by 129 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A hybrid optimization algorithm for multi-agent dynamic planning with guaranteed convergence in probability;Neurocomputing;2024-08

2. Machine Learning Applications for Online Partial Discharge Detection, Classification, and Localization in Power Transformers: A Review;2024 4th International Conference on Smart Grid and Renewable Energy (SGRE);2024-01-08

3. Opposition-Based Crossover Operation for Differential Evolution Algorithm;2023 IEEE Symposium Series on Computational Intelligence (SSCI);2023-12-05

4. A hybrid grey wolf optimizer using opposition-based learning, sine cosine algorithm and reinforcement learning for reliable scheduling and resource allocation;Journal of Systems and Software;2023-11

5. Day-ahead scheduling of isolated microgrid integrated demand side management;Soft Computing;2023-09-29