Affiliation:
1. Instituto de Informática, Universidade Federal do Rio Grande do Sul (UFRGS) Porto Alegre, Brazil
Abstract
Using reinforcement learning (RL) to support agents in making decisions that
consider more than one objective poses challenges. We formulate the problem
of multiple agents learning how to travel from A to B as a reinforcement
learning task modeled as a stochastic game, in which we take into account:
(i) more than one objective, (ii) non-stationarity, (iii) communication of
local and non-local information among the various actors. We use and compare
RL algorithms, both for the single objective (Q-learning), as well as for
multiple objectives (Pareto Q learning), with and without non-local
communication. We evaluate these methods in a scenario in which hundreds of
agents have to learn how to travel from their origins to their destinations,
aiming at minimizing their travel times, as well as the carbon monoxide
vehicles emit. Results show that the use of non-local communication reduces
both travel time and emissions.
Publisher
National Library of Serbia
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Application of Reinforcement Learning Algorithm in Personalized Music Teaching;2024 2nd International Conference on Mechatronics, IoT and Industrial Informatics (ICMIII);2024-06-12