Online Learning of Unknown Dynamics for Model-Based Controllers in Legged Locomotion

Online Learning of Unknown Dynamics for Model-Based Controllers in Legged Locomotion. Sun, Y., Ubellacker, W. L., Ma, W., Zhang, X., Wang, C., Csomay-Shanklin, N. V., Tomizuka, M., Sreenath, K., & Ames, A. D. IEEE Robotics and Automation Letters, 6(4):8442–8449, October, 2021.

Paper doi abstract bibtex 14 downloads

The performance of a model-based controller can severely suffer when its model inaccurately represents the real world dynamics. We propose to learn a time-varying, locally linear residual model along the robot’s current trajectory, to compensate for the prediction errors of the controller’s model. Supervised learning is performed online, as the robot is running in the unknown environment, using data collected from its immediate past. We theoretically investigate our method in its general formulation, then apply it to a bipedal controller derived from the full-order dynamics of virtual constraints, and a quadrupedal controller derived from a simpliﬁed model of contact forces. For a biped in simulation, our method consistently outperforms the baseline and a recent learning-based method. We also experiment with a 12 kg quadruped in simulation and real world, where the baseline fails to walk with 10 kg of payload but our method succeeds.

@article{sun_online_2021,
	title = {Online {Learning} of {Unknown} {Dynamics} for {Model}-{Based} {Controllers} in {Legged} {Locomotion}},
	volume = {6},
	issn = {2377-3766, 2377-3774},
	url = {https://ieeexplore.ieee.org/document/9525285/},
	doi = {10.1109/LRA.2021.3108510},
	abstract = {The performance of a model-based controller can severely suffer when its model inaccurately represents the real world dynamics. We propose to learn a time-varying, locally linear residual model along the robot’s current trajectory, to compensate for the prediction errors of the controller’s model. Supervised learning is performed online, as the robot is running in the unknown environment, using data collected from its immediate past. We theoretically investigate our method in its general formulation, then apply it to a bipedal controller derived from the full-order dynamics of virtual constraints, and a quadrupedal controller derived from a simpliﬁed model of contact forces. For a biped in simulation, our method consistently outperforms the baseline and a recent learning-based method. We also experiment with a 12 kg quadruped in simulation and real world, where the baseline fails to walk with 10 kg of payload but our method succeeds.},
	language = {en},
	number = {4},
	urldate = {2021-12-14},
	journal = {IEEE Robotics and Automation Letters},
	author = {Sun, Yu and Ubellacker, Wyatt L. and Ma, Wen-Loong and Zhang, Xiang and Wang, Changhao and Csomay-Shanklin, Noel V. and Tomizuka, Masayoshi and Sreenath, Koushil and Ames, Aaron D.},
	month = oct,
	year = {2021},
	pages = {8442--8449},
}

Downloads: 14

{"_id":"j7WPbtYzPhApp9MfZ","bibbaseid":"sun-ubellacker-ma-zhang-wang-csomayshanklin-tomizuka-sreenath-etal-onlinelearningofunknowndynamicsformodelbasedcontrollersinleggedlocomotion-2021","author_short":["Sun, Y.","Ubellacker, W. L.","Ma, W.","Zhang, X.","Wang, C.","Csomay-Shanklin, N. V.","Tomizuka, M.","Sreenath, K.","Ames, A. D."],"bibdata":{"bibtype":"article","type":"article","title":"Online Learning of Unknown Dynamics for Model-Based Controllers in Legged Locomotion","volume":"6","issn":"2377-3766, 2377-3774","url":"https://ieeexplore.ieee.org/document/9525285/","doi":"10.1109/LRA.2021.3108510","abstract":"The performance of a model-based controller can severely suffer when its model inaccurately represents the real world dynamics. We propose to learn a time-varying, locally linear residual model along the robot’s current trajectory, to compensate for the prediction errors of the controller’s model. Supervised learning is performed online, as the robot is running in the unknown environment, using data collected from its immediate past. We theoretically investigate our method in its general formulation, then apply it to a bipedal controller derived from the full-order dynamics of virtual constraints, and a quadrupedal controller derived from a simpliﬁed model of contact forces. For a biped in simulation, our method consistently outperforms the baseline and a recent learning-based method. We also experiment with a 12 kg quadruped in simulation and real world, where the baseline fails to walk with 10 kg of payload but our method succeeds.","language":"en","number":"4","urldate":"2021-12-14","journal":"IEEE Robotics and Automation Letters","author":[{"propositions":[],"lastnames":["Sun"],"firstnames":["Yu"],"suffixes":[]},{"propositions":[],"lastnames":["Ubellacker"],"firstnames":["Wyatt","L."],"suffixes":[]},{"propositions":[],"lastnames":["Ma"],"firstnames":["Wen-Loong"],"suffixes":[]},{"propositions":[],"lastnames":["Zhang"],"firstnames":["Xiang"],"suffixes":[]},{"propositions":[],"lastnames":["Wang"],"firstnames":["Changhao"],"suffixes":[]},{"propositions":[],"lastnames":["Csomay-Shanklin"],"firstnames":["Noel","V."],"suffixes":[]},{"propositions":[],"lastnames":["Tomizuka"],"firstnames":["Masayoshi"],"suffixes":[]},{"propositions":[],"lastnames":["Sreenath"],"firstnames":["Koushil"],"suffixes":[]},{"propositions":[],"lastnames":["Ames"],"firstnames":["Aaron","D."],"suffixes":[]}],"month":"October","year":"2021","pages":"8442–8449","bibtex":"@article{sun_online_2021,\n\ttitle = {Online {Learning} of {Unknown} {Dynamics} for {Model}-{Based} {Controllers} in {Legged} {Locomotion}},\n\tvolume = {6},\n\tissn = {2377-3766, 2377-3774},\n\turl = {https://ieeexplore.ieee.org/document/9525285/},\n\tdoi = {10.1109/LRA.2021.3108510},\n\tabstract = {The performance of a model-based controller can severely suffer when its model inaccurately represents the real world dynamics. We propose to learn a time-varying, locally linear residual model along the robot’s current trajectory, to compensate for the prediction errors of the controller’s model. Supervised learning is performed online, as the robot is running in the unknown environment, using data collected from its immediate past. We theoretically investigate our method in its general formulation, then apply it to a bipedal controller derived from the full-order dynamics of virtual constraints, and a quadrupedal controller derived from a simpliﬁed model of contact forces. For a biped in simulation, our method consistently outperforms the baseline and a recent learning-based method. We also experiment with a 12 kg quadruped in simulation and real world, where the baseline fails to walk with 10 kg of payload but our method succeeds.},\n\tlanguage = {en},\n\tnumber = {4},\n\turldate = {2021-12-14},\n\tjournal = {IEEE Robotics and Automation Letters},\n\tauthor = {Sun, Yu and Ubellacker, Wyatt L. and Ma, Wen-Loong and Zhang, Xiang and Wang, Changhao and Csomay-Shanklin, Noel V. and Tomizuka, Masayoshi and Sreenath, Koushil and Ames, Aaron D.},\n\tmonth = oct,\n\tyear = {2021},\n\tpages = {8442--8449},\n}\n\n","author_short":["Sun, Y.","Ubellacker, W. L.","Ma, W.","Zhang, X.","Wang, C.","Csomay-Shanklin, N. V.","Tomizuka, M.","Sreenath, K.","Ames, A. D."],"key":"sun_online_2021","id":"sun_online_2021","bibbaseid":"sun-ubellacker-ma-zhang-wang-csomayshanklin-tomizuka-sreenath-etal-onlinelearningofunknowndynamicsformodelbasedcontrollersinleggedlocomotion-2021","role":"author","urls":{"Paper":"https://ieeexplore.ieee.org/document/9525285/"},"metadata":{"authorlinks":{}},"downloads":14},"bibtype":"article","biburl":"https://api.zotero.org/users/5612529/collections/DGS34TEY/items?key=aiprMlXOSKe71AbbxNPHHfe7&format=bibtex&limit=100","dataSources":["qAk5kJqscSPPSguqo","c5Y5KxspQmgaqHb44","itnDcZtAo5EAWcXeE","JyoDgr7qvkjGYcFft","aeHt7AWKsAw9iiQqD"],"keywords":[],"search_terms":["online","learning","unknown","dynamics","model","based","controllers","legged","locomotion","sun","ubellacker","ma","zhang","wang","csomay-shanklin","tomizuka","sreenath","ames"],"title":"Online Learning of Unknown Dynamics for Model-Based Controllers in Legged Locomotion","year":2021,"downloads":14}