Imitation Learning of Stabilizing Policies for Nonlinear Systems

Imitation Learning of Stabilizing Policies for Nonlinear Systems. East, S. arXiv:2109.10854 [cs, math], September, 2021. arXiv: 2109.10854

Paper abstract bibtex

There has been a recent interest in imitation learning methods that are guaranteed to produce a stabilizing control law with respect to a known system. Work in this area has generally considered linear systems and controllers, for which stabilizing imitation learning takes the form of a biconvex optimization problem. In this paper it is demonstrated that the same methods developed for linear systems and controllers can be readily extended to polynomial systems and controllers using sum of squares techniques. A projected gradient descent algorithm and an alternating direction method of multipliers algorithm are proposed as heuristics for solving the stabilizing imitation learning problem, and their performance is illustrated through numerical experiments.

@article{east_imitation_2021,
	title = {Imitation {Learning} of {Stabilizing} {Policies} for {Nonlinear} {Systems}},
	url = {http://arxiv.org/abs/2109.10854},
	abstract = {There has been a recent interest in imitation learning methods that are guaranteed to produce a stabilizing control law with respect to a known system. Work in this area has generally considered linear systems and controllers, for which stabilizing imitation learning takes the form of a biconvex optimization problem. In this paper it is demonstrated that the same methods developed for linear systems and controllers can be readily extended to polynomial systems and controllers using sum of squares techniques. A projected gradient descent algorithm and an alternating direction method of multipliers algorithm are proposed as heuristics for solving the stabilizing imitation learning problem, and their performance is illustrated through numerical experiments.},
	urldate = {2021-09-28},
	journal = {arXiv:2109.10854 [cs, math]},
	author = {East, Sebastian},
	month = sep,
	year = {2021},
	note = {arXiv: 2109.10854},
	keywords = {machine learning, mentions sympy, optimization},
}

Downloads: 0

{"_id":"YuY67TqwgD2LFe79p","bibbaseid":"east-imitationlearningofstabilizingpoliciesfornonlinearsystems-2021","author_short":["East, S."],"bibdata":{"bibtype":"article","type":"article","title":"Imitation Learning of Stabilizing Policies for Nonlinear Systems","url":"http://arxiv.org/abs/2109.10854","abstract":"There has been a recent interest in imitation learning methods that are guaranteed to produce a stabilizing control law with respect to a known system. Work in this area has generally considered linear systems and controllers, for which stabilizing imitation learning takes the form of a biconvex optimization problem. In this paper it is demonstrated that the same methods developed for linear systems and controllers can be readily extended to polynomial systems and controllers using sum of squares techniques. A projected gradient descent algorithm and an alternating direction method of multipliers algorithm are proposed as heuristics for solving the stabilizing imitation learning problem, and their performance is illustrated through numerical experiments.","urldate":"2021-09-28","journal":"arXiv:2109.10854 [cs, math]","author":[{"propositions":[],"lastnames":["East"],"firstnames":["Sebastian"],"suffixes":[]}],"month":"September","year":"2021","note":"arXiv: 2109.10854","keywords":"machine learning, mentions sympy, optimization","bibtex":"@article{east_imitation_2021,\n\ttitle = {Imitation {Learning} of {Stabilizing} {Policies} for {Nonlinear} {Systems}},\n\turl = {http://arxiv.org/abs/2109.10854},\n\tabstract = {There has been a recent interest in imitation learning methods that are guaranteed to produce a stabilizing control law with respect to a known system. Work in this area has generally considered linear systems and controllers, for which stabilizing imitation learning takes the form of a biconvex optimization problem. In this paper it is demonstrated that the same methods developed for linear systems and controllers can be readily extended to polynomial systems and controllers using sum of squares techniques. A projected gradient descent algorithm and an alternating direction method of multipliers algorithm are proposed as heuristics for solving the stabilizing imitation learning problem, and their performance is illustrated through numerical experiments.},\n\turldate = {2021-09-28},\n\tjournal = {arXiv:2109.10854 [cs, math]},\n\tauthor = {East, Sebastian},\n\tmonth = sep,\n\tyear = {2021},\n\tnote = {arXiv: 2109.10854},\n\tkeywords = {machine learning, mentions sympy, optimization},\n}\n\n\n\n\n\n\n\n","author_short":["East, S."],"key":"east_imitation_2021","id":"east_imitation_2021","bibbaseid":"east-imitationlearningofstabilizingpoliciesfornonlinearsystems-2021","role":"author","urls":{"Paper":"http://arxiv.org/abs/2109.10854"},"keyword":["machine learning","mentions sympy","optimization"],"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://bibbase.org/zotero-group/nicoguaro/525293","dataSources":["YtBDXPDiQEyhyEDZC","fhHfrQgj3AaGp7e9E","qzbMjEJf5d9Lk78vE","45tA9RFoXA9XeH4MM","MeSgs2KDKZo3bEbxH","nSXCrcahhCNfzvXEY","ecatNAsyr4f2iQyGq","tpWeaaCgFjPTYCjg3"],"keywords":["machine learning","mentions sympy","optimization"],"search_terms":["imitation","learning","stabilizing","policies","nonlinear","systems","east"],"title":"Imitation Learning of Stabilizing Policies for Nonlinear Systems","year":2021}