Reconfigurable Embedded Devices Using Reinforcement Learning to Develop Action Policies-Reference-Cited by-同舟云学术

Reconfigurable Embedded Devices Using Reinforcement Learning to Develop Action Policies

Published:2020-12-31 Issue:4 Volume:15 Page:1-25
ISSN:1556-4665
Container-title:ACM Transactions on Autonomous and Adaptive Systems
language:en
Short-container-title:ACM Trans. Auton. Adapt. Syst.

Author:

Burger Alwyn¹^ORCID,Schiele Gregor¹^ORCID,King David W.²^ORCID

Affiliation:

1. University of Duisburg-Essen, Duisburg, Germany

2. Air Force Institute of Technology, Dayton, Ohio, USA

Abstract

The size of sensor networks supporting smart cities is ever increasing. Sensor network resiliency becomes vital for critical networks such as emergency response and waste water treatment. One approach is to engineer “self-aware” sensors that can proactively change their component composition in response to changes in work load when critical devices fail. By extension, these devices could anticipate their own termination, such as battery depletion, and offload current tasks onto connected devices. These neighboring devices can then reconfigure themselves to process these tasks, thus avoiding catastrophic network failure. In this article, we compare and contrast two types of self-aware sensors. One set uses Q-learning to develop a policy that guides device reaction to various environmental stimuli, whereas the others use a set of shallow neural networks to select an appropriate reaction. The novelty lies in the use of field programmable gate arrays embedded on the sensors that take into account internal system state, configuration, and learned state-action pairs, which guide device decisions to meet system demands. Experiments show that even relatively simple reward functions develop both Q-learning policies and shallow neural networks that yield positive device behaviors in dynamic environments.

Funder

Federal Ministry of Education and Research of Germany

Publisher

Association for Computing Machinery (ACM)

Subject

Software,Computer Science (miscellaneous),Control and Systems Engineering

Link

https://dl.acm.org/doi/pdf/10.1145/3487920

Reference42 articles.

1. Chloe M. Barnes Anikó Ekárt and Peter R. Lewis. 2019. Social action in socially situated agents. In Proceedings of the International Conference on Self-Adaptive and Self-Organizing Systems (SASO’19) . 97–106. https://doi.org/10.1109/SASO.2019.00021

2. On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures

3. Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan et al. 2020. Language models are few-shot learners. arxiv:2005.14165 [cs.CL].

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Keynote: The Elastic AI Ecosystem — Towards A Holistic Pervasive System for Adaptive Artificial Intelligence;2023 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops);2023-03-13