Optimal parameter trajectory estimation in parameterized SDEs-Reference-Cited by-同舟云学术

Optimal parameter trajectory estimation in parameterized SDEs

Published:2009-03 Issue:2 Volume:19 Page:1-27
ISSN:1049-3301
Container-title:ACM Transactions on Modeling and Computer Simulation
language:en
Short-container-title:ACM Trans. Model. Comput. Simul.

Author:

Bhatnagar Shalabh¹,Karmeshu ²,Mishra Vivek Kumar¹

Affiliation:

1. Indian Institute of Science, Bangalore, India

2. Jawaharlal Nehru University, New Delhi, India

Abstract

We consider the problem of estimating the optimal parameter trajectory over a finite time interval in a parameterized stochastic differential equation (SDE), and propose a simulation-based algorithm for this purpose. Towards this end, we consider a discretization of the SDE over finite time instants and reformulate the problem as one of finding an optimal parameter at each of these instants. A stochastic approximation algorithm based on the smoothed functional technique is adapted to this setting for finding the optimal parameter trajectory. A proof of convergence of the algorithm is presented and results of numerical experiments over two different settings are shown. The algorithm is seen to exhibit good performance. We also present extensions of our framework to the case of finding optimal parameterized feedback policies for controlled SDE and present numerical results in this scenario as well.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,Modeling and Simulation

Link

https://dl.acm.org/doi/pdf/10.1145/1502787.1502791

Reference33 articles.

1. Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes

2. Bertsekas D. P. and Gallager R. G. 1991. Data Networks. Prentice-Hall New York. Bertsekas D. P. and Gallager R. G. 1991. Data Networks. Prentice-Hall New York.

3. Bertsekas D. P. and Tsitsiklis J. N. 1996. Neuro-Dynamic Programming. Athena Scientific Belmont MA. Bertsekas D. P. and Tsitsiklis J. N. 1996. Neuro-Dynamic Programming. Athena Scientific Belmont MA.

4. Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A simulation-based algorithm for optimal pricing policy under demand uncertainty;International Transactions in Operational Research;2013-12-30

2. Communication Networks;Stochastic Recursive Algorithms for Optimization;2013

3. Introduction;Stochastic Recursive Algorithms for Optimization;2013

4. An Optimized SDE Model for Slotted Aloha;IEEE Transactions on Communications;2011-06

5. Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems;Applied Mathematical Modelling;2011-06