Imitation Learning with Stability and Safety Guarantees. Yin, H., Seiler, P., Jin, M., & Arcak, M. 2021. IEEE Control Systems Letters
Imitation Learning with Stability and Safety Guarantees [pdf]Pdf  Imitation Learning with Stability and Safety Guarantees [link]Arxiv  abstract   bibtex   1 download  
A method is presented to synthesize neural network (NN) controllers with stability and safety guarantees through imitation learning. Convex stability and safety conditions are derived for linear time-invariant plant dynamics with NN controllers. The proposed approach merges Lyapunov theory with local quadratic constraints to bound the nonlinear activation functions in the NN. The safe imitation learning problem is formulated as an optimization problem with the goal of minimizing the imitation learning loss, and maximizing volume of the region of attraction associated with the NN controller, while enforcing the stability and safety conditions. An alternating direction method of multipliers based algorithm is proposed to solve the optimization. The method is illustrated on an inverted pendulum system and aircraft longitudinal dynamics.

Downloads: 1