A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty-Reference-Cited by-同舟云学术

A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty

Published:2009-05-20 Issue:4 Volume:131 Page:
ISSN:0022-0434
Container-title:Journal of Dynamic Systems, Measurement, and Control
language:en
Short-container-title:

Author:

Malikopoulos Andreas A.¹,Papalambros Panos Y.¹,Assanis Dennis N.¹

Affiliation:

1. Department of Mechanical Engineering, University of Michigan, Ann Arbor, MI 48109

Abstract

Modeling dynamic systems incurring stochastic disturbances for deriving a control policy is a ubiquitous task in engineering. However, in some instances obtaining a model of a system may be impractical or impossible. Alternative approaches have been developed using a simulation-based stochastic framework, in which the system interacts with its environment in real time and obtains information that can be processed to produce an optimal control policy. In this context, the problem of developing a policy for controlling the system’s behavior is formulated as a sequential decision-making problem under uncertainty. This paper considers the problem of deriving a control policy for a dynamic system with unknown dynamics in real time, formulated as a sequential decision-making under uncertainty. The evolution of the system is modeled as a controlled Markov chain. A new state-space representation model and a learning mechanism are proposed that can be used to improve system performance over time. The major difference between the existing methods and the proposed learning model is that the latter utilizes an evaluation function, which considers the expected cost that can be achieved by state transitions forward in time. The model allows decision-making based on gradually enhanced knowledge of system response as it transitions from one state to another, in conjunction with actions taken at each state. The proposed model is demonstrated on the single cart-pole balancing problem and a vehicle cruise-control problem.

Publisher

ASME International

Subject

Computer Science Applications,Mechanical Engineering,Instrumentation,Information Systems,Control and Systems Engineering

Link

http://asmedigitalcollection.asme.org/dynamicsystems/article-pdf/doi/10.1115/1.3117200/5781331/041010_1.pdf

Reference41 articles.

1. Reinforcement Learning for Long-Run Average Cost;Gosavi;Eur. J. Oper. Res.

2. Neuro-Dynamic Programming;Bertsekas

3. Reinforcement Learning: An Introduction;Sutton

4. A Learning Algorithm for Discrete-Time Stochastic Control;Borkar;Probability in the Engineering and Informational Sciences

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Combining learning and control in linear systems;European Journal of Control;2024-06

2. Enhanced Mobility With Connectivity and Automation: A Review of Shared Autonomous Vehicle Systems;IEEE Intelligent Transportation Systems Magazine;2022-01

3. Tracking Control of a Continuous Stirred Tank Reactor Using Direct and Tuned Reinforcement Learning Based Controllers;Chemical Product and Process Modeling;2017-11-14

4. Case Studies;Simulation-Based Optimization;2014-08-07

5. Exploring the Impact of Speed Synchronization through Connected Vehicle Technology on Fleet-Level Fuel Economy;SAE International Journal of Passenger Cars - Electronic and Electrical Systems;2013-04-08