Gershgorin Loss Stabilizes the Recurrent Neural Network Compartment of an End-to-end Robot Learning Scheme

Lechner, Mathias; Hasani, Ramin; Rus, Daniela; Grosu, Radu

doi:10.1109/ICRA40945.2020.9196608

Datensatz Zitierlink:

http://hdl.handle.net/20.500.12708/15630
https://doi.org/10.34726/242

Titel:

Gershgorin Loss Stabilizes the Recurrent Neural Network Compartment of an End-to-end Robot Learning Scheme

Zitat:

Lechner, M., Hasani, R., Rus, D., & Grosu, R. (2020). Gershgorin Loss Stabilizes the Recurrent Neural Network Compartment of an End-to-end Robot Learning Scheme. In 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE. https://doi.org/10.34726/242

reposiTUm-DOI:

10.34726/242

CatalogPlus:

AC17204828

Verlags-DOI:

10.1109/ICRA40945.2020.9196608

Publikationstyp:

Konferenzbeitrag - Full-Paper Contribution

Sprache:

Englisch

Autor_innen:

Lechner, Mathias
Hasani, Ramin
Rus, Daniela
Grosu, Radu

Organisationseinheit:

E191-01 - Forschungsbereich Cyber-Physical Systems

Erschienen in:

2020 IEEE International Conference on Robotics and Automation (ICRA)

ISBN:

978-1-7281-7395-5

Band:

2020

DOI des Buches:

10.1109/ICRA40945.2020

Datum (veröffentlicht):

31-Aug-2020

Umfang:

Verlag:

IEEE

Keywords:

dynamical systems; Robot Learning; Continuous-time recurrent neural networks; Deep Learning; Machine Learning; Artificial Intelligence; neural networks

Abstract:

Traditional robotic control suits require profound task-specific knowledge for designing, building and testing control software. The rise of Deep Learning has enabled end-to-end solutions to be learned entirely from data, requiring minimal knowledge about the application area. We design a learning scheme to train end-to-end linear dynamical systems (LDS)s by gradient descent in imitation learning robotic domains. We introduce a new regularization loss component together with a learning algorithm that improves the stability of the learned autonomous system, by forcing the eigenvalues of the internal state updates of an LDS to be negative reals. We evaluate our approach on a series of real-life and simulated robotic experiments, in comparison to linear and nonlinear Recurrent Neural Network (RNN) architectures. Our results show that our stabilizing method significantly improves test performance of LDS, enabling such linear models to match the performance of contemporary nonlinear RNN architectures. A video of the obstacle avoidance performance of our method on a mobile robot, in unseen environments, compared to other methods can be viewed at https://youtu.be/mhEsCoNao5E.

Weitere Information:

“© © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.”

Lizenz:

Urheberrechtsschutz

Enthalten in den Sammlungen:

Conference Paper