Abstract
Shift schemes are commonly used in non-convex situations when solving unconstrained discrete-time optimal control problems by the differential dynamic programming (DDP) method. However, the existing shift schemes are inefficient when the shift becomes too large. In this paper, a new method of combining the DDP method with a shift scheme and the steepest descent method is proposed to cope with non-convex situations. Under certain assumptions, the proposed method is globally convergent and has q-quadratic local conve rgence. Extensive numerical experiments on many test problems in the literature are reported. These numerical results illustrate the robustness and efficiency of the proposed method.
Similar content being viewed by others
References
Bertsekas, D. P. (1982), Projected Newton methods for optimization problems with simple constraints, SIAM Journal of Control and Optimization 20, 221–246.
Dennis, J. E. and Schnabel, R. B. (1983), Numerical Methods for Unconstrained Optimization and Nonlinear Equations, Prentice-Hall, Englewood Cliffs, NJ.
Di Pillo, G., Grippo, L. and Lampariello, F. (1983), A class of structured quasi-Newton algorithms for optimal control problems, IFAC Applications of Nonlinear Programming to Optimization and Control, Edited by H. E. Rauch, International Federation of Automatic Control, Pergamon Press, NY, 101–107.
Fletcher, R. (1987), Practical Methods of Optimization, Second Edition, John Wiley and Sons, Chichester.
Gill, P. E., Murray, W. and Wright, M. H. (1981), Practical Optimization, Academic Press, London.
Jacobson, D. H. and Mayne, D. Q. (1970), Differential Dynamic Programming, American Elsevier, New York.
Liao, L. Z. and Shoemaker, C. A. (1991), Convergence in unconstrained discrete-time differential dynamic programming, IEEE Transactions on Automatic Control 36, 692–706.
Liao, L. Z. and Shoemaker, C. A. (1992), Comparison of differential dynamic programming and Newton's method for discrete-time optimal control Problems, Technical Report CTC92TR97, Cornell University.
Luenberger, D. G. (1989), Linear and Nonlinear Programming, Second Edition, Addison-Wesley, Reading, MA.
Mayne, D. Q. (1966), A second-order gradient method for determining optimal trajectories of non-linear discrete-time systems, International Journal of Control 3:1, 85–95.
Murray, D. and Yakowitz, S. (1981), The application of optimal control methodology to nonlinear programming problems, Mathematical Programming 21, 331–347.
Murray, D. M. and Yakowitz, S. J. (1984), Differential dynamic programming and Newton's method for discrete optimal control problems, Journal of Optimization Theory and Applications 43:3, 395–414.
Pantoja, J. F. A. De O. (1988), Differential dynamic programming and Newton's method, International Journal of Control47:5, 1539–1553.
Yakowitz, S. and Rutherford, B. (1984), Computational aspects of discrete-time optimal control, Applied Mathematics and Computation 15, 29–45.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ng, C.K., Liao, LZ. & Li, D. A Globally Convergent and Efficient Method for Unconstrained Discrete-Time Optimal Control. Journal of Global Optimization 23, 401–421 (2002). https://doi.org/10.1023/A:1016595100139
Issue Date:
DOI: https://doi.org/10.1023/A:1016595100139