Affiliation:
1. School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, GA 30332
Abstract
This paper presents a numerical algorithm for finding the bang-bang control input associated with the time optimal solution of a class of nonlinear dynamic systems. The proposed algorithm directly searches for the optimal switching instants based on a projected gradient optimization method. It is shown that this algorithm can be made into a learning algorithm by using on-line measurements of the state trajectory. The learning is shown to have the potential for significant robustness to mismatch between the model and the system. It learns a nearly optimal input through repeated trials in which it utilizes the measured terminal state error of the actual system and gradients based on the theoretical state equation of the system but evaluated along the actual state trajectory. The success of the method is demonstrated on an underactuated double pendulum system called the acrobot.
Subject
Computer Science Applications,Mechanical Engineering,Instrumentation,Information Systems,Control and Systems Engineering
Reference16 articles.
1. Bazaraa, M. S., and Shetty, C. M., Nonlinear Programming, Wiley, New York, 1979.
2. Bobrow, J. E. et al., “On the Optimal Control of Robotic Manipulators with Actuator Constraints,” Proceedings of the 1983 American Control conference, Vol. 2, pp. 782–787, 1983.
3. Bryson, A. E., and Ho, Y., Applied Optimal Control, Hemisphere Publishing, New York, 1975.
4. Byers, et al., “Near-Minimum Time, Closed Loop Slewing of Flexible Spacecraft,” Journal of Guidance, Control, and Dynamics, Vol. 13, No. 1, Jan.–Feb., 1990.
5. Byers, R. M., and Vadali, S. R., “Quasi-Closed-Form Solution to the Time-Optimal Rigid Spacecraft Reorientation Problem,” Journal of Guidance, Control, and Dynamics, Vol. 16, No. 3, May-June 1993.
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献