GENEREIT: generating multi-talented reinforcement learning agents-Reference-Cited by-同舟云学术

GENEREIT: generating multi-talented reinforcement learning agents

Published:2023-01-09 Issue:2 Volume:15 Page:643-650
ISSN:2511-2104
Container-title:International Journal of Information Technology
language:en
Short-container-title:Int. j. inf. tecnol.

Author:

Lazaridis Aristotelis^ORCID,Vlahavas Ioannis

Abstract

AbstractCreating an intelligent system that is able to generalize and reach human or above-human performance in a variety of tasks will be part of the crowning achievement of Artificial General Intelligence. However, even though many steps have been taken towards this direction, they have critical shortcomings that prevent the research community from drawing a clear path towards that goal, such as limited learning capacity of a model, sample-inefficiency or low overall performance. In this paper, we propose GENEREIT, a meta-Reinforcement Learning model in which a single Deep Reinforcement Learning agent (meta-learner) is able to produce high-performance agents (inner-learners) for solving different environments under a single training session, in a sample-efficient way, as shown by primary results in a set of various toy-like environments. This is partially due to the fixed subset selection strategy implementation that allows the meta-learner to focus on tuning specific traits of the generated agents rather than tuning them completely. This, combined with our system’s modular design for introducing higher levels in the meta-learning hierarchy, can also be potentially immune to catastrophic forgetting and provide ample learning capacity.

Funder

Aristotle University of Thessaloniki

Publisher

Springer Science and Business Media LLC

Subject

Electrical and Electronic Engineering,Applied Mathematics,Artificial Intelligence,Computational Theory and Mathematics,Computer Networks and Communications,Computer Science Applications,Information Systems

Link

https://link.springer.com/content/pdf/10.1007/s41870-022-01137-y.pdf

Reference21 articles.

1. Sutton RS, Barto AG (2018) Reinforcement learning: An introduction. MIT press, Cambridge

2. Mnih V et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533

3. Bellemare MG, Naddaf Y, Veness J, Bowling M (2013) The arcade learning environment: An evaluation platform for general agents. J Artif Intell Res 47:253–279

4. Rani G, Pandey U, Wagde AA, Dhaka VS (2022) A deep reinforcement learning technique for bug detection in video games. Int J Info Technol 1–13

5. Khurana S, Upadhayaya S (2020) Spectrum management in cognitive radio ad-hoc network using q-learning. Int J Info Technol 12(2):599–604

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improved gradient boosting hybrid spectrum sharing and actor critic channel allocation in 6G CR-IOT;International Journal of Information Technology;2024-06-23

2. Deep learning-based personalized learning recommendation system design for "T++" Guzheng Pedagogy;International Journal of Information Technology;2024-05-07

3. Predictive Modeling of Concrete Compressive Strength using Machine Learning Algorithms;2024 11th International Conference on Computing for Sustainable Global Development (INDIACom);2024-02-28

4. Hybrid ResNet152-EML model for Geo-spatial image classification;International Journal of Information Technology;2023-10-13

5. IASMFT: intelligent agent simulation model for future trading;International Journal of Information Technology;2023-09-01