Adaptive Dynamic Programming

Adaptive Dynamic Programming. Murray, J., Cox, C., Lendaris, G., & Saeks, R. IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 32(2):140–153, May, 2002.

Paper doi abstract bibtex

An Adaptive Dynamic Programming algorithm for nonlinear systems with unknown dynamics is developed. The algorithm is initialized with a positive definite cost functional / stabilizing control law pair (V0, k0) (coupled via the Hamilton Jacobi Bellman Equation). Given (Vi, ki), one runs the system using control law ki recording the state and control trajectories, with these trajectories used to define Vi+1 as the cost to take the initial state x0 to the final state using control law, ki, while ki+1 is taken to be the control law derived from Vi+1 via Hamilton Jacobi Bellman Equation.

@article{murray_adaptive_2002,
	title = {Adaptive {Dynamic} {Programming}},
	volume = {32},
	issn = {1094-6977},
	url = {http://ieeexplore.ieee.org/document/1039198/},
	doi = {10.1109/tsmcc.2002.801727},
	abstract = {An Adaptive Dynamic Programming algorithm for nonlinear systems with unknown dynamics is developed. The algorithm is initialized with a positive definite cost functional / stabilizing control law pair (V0, k0) (coupled via the Hamilton Jacobi Bellman Equation). Given (Vi, ki), one runs the system using control law ki recording the state and control trajectories, with these trajectories used to define Vi+1 as the cost to take the initial state x0 to the final state using control law, ki, while ki+1 is taken to be the control law derived from Vi+1 via Hamilton Jacobi Bellman Equation.},
	language = {en},
	number = {2},
	urldate = {2022-02-03},
	journal = {IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)},
	author = {Murray, J.J. and Cox, C.J. and Lendaris, G.G. and Saeks, R.},
	month = may,
	year = {2002},
	keywords = {/unread},
	pages = {140--153},
}

Downloads: 0

{"_id":"ewhRuwsoqPy2b687E","bibbaseid":"murray-cox-lendaris-saeks-adaptivedynamicprogramming-2002","authorIDs":[],"author_short":["Murray, J.","Cox, C.","Lendaris, G.","Saeks, R."],"bibdata":{"bibtype":"article","type":"article","title":"Adaptive Dynamic Programming","volume":"32","issn":"1094-6977","url":"http://ieeexplore.ieee.org/document/1039198/","doi":"10.1109/tsmcc.2002.801727","abstract":"An Adaptive Dynamic Programming algorithm for nonlinear systems with unknown dynamics is developed. The algorithm is initialized with a positive definite cost functional / stabilizing control law pair (V0, k0) (coupled via the Hamilton Jacobi Bellman Equation). Given (Vi, ki), one runs the system using control law ki recording the state and control trajectories, with these trajectories used to define Vi+1 as the cost to take the initial state x0 to the final state using control law, ki, while ki+1 is taken to be the control law derived from Vi+1 via Hamilton Jacobi Bellman Equation.","language":"en","number":"2","urldate":"2022-02-03","journal":"IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)","author":[{"propositions":[],"lastnames":["Murray"],"firstnames":["J.J."],"suffixes":[]},{"propositions":[],"lastnames":["Cox"],"firstnames":["C.J."],"suffixes":[]},{"propositions":[],"lastnames":["Lendaris"],"firstnames":["G.G."],"suffixes":[]},{"propositions":[],"lastnames":["Saeks"],"firstnames":["R."],"suffixes":[]}],"month":"May","year":"2002","keywords":"/unread","pages":"140–153","bibtex":"@article{murray_adaptive_2002,\n\ttitle = {Adaptive {Dynamic} {Programming}},\n\tvolume = {32},\n\tissn = {1094-6977},\n\turl = {http://ieeexplore.ieee.org/document/1039198/},\n\tdoi = {10.1109/tsmcc.2002.801727},\n\tabstract = {An Adaptive Dynamic Programming algorithm for nonlinear systems with unknown dynamics is developed. The algorithm is initialized with a positive definite cost functional / stabilizing control law pair (V0, k0) (coupled via the Hamilton Jacobi Bellman Equation). Given (Vi, ki), one runs the system using control law ki recording the state and control trajectories, with these trajectories used to define Vi+1 as the cost to take the initial state x0 to the final state using control law, ki, while ki+1 is taken to be the control law derived from Vi+1 via Hamilton Jacobi Bellman Equation.},\n\tlanguage = {en},\n\tnumber = {2},\n\turldate = {2022-02-03},\n\tjournal = {IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)},\n\tauthor = {Murray, J.J. and Cox, C.J. and Lendaris, G.G. and Saeks, R.},\n\tmonth = may,\n\tyear = {2002},\n\tkeywords = {/unread},\n\tpages = {140--153},\n}\n\n","author_short":["Murray, J.","Cox, C.","Lendaris, G.","Saeks, R."],"key":"murray_adaptive_2002","id":"murray_adaptive_2002","bibbaseid":"murray-cox-lendaris-saeks-adaptivedynamicprogramming-2002","role":"author","urls":{"Paper":"http://ieeexplore.ieee.org/document/1039198/"},"keyword":["/unread"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"article","biburl":"https://bibbase.org/zotero/victorjhu","creationDate":"2019-05-28T23:29:25.433Z","downloads":0,"keywords":["/unread"],"search_terms":["adaptive","dynamic","programming","murray","cox","lendaris","saeks"],"title":"Adaptive Dynamic Programming","year":2002,"dataSources":["aDrN6vnZWqY8fA7E8","CmHEoydhafhbkXXt5"]}