(Non-)Convergence Results for Predictive Coding Networks

Frieder, Simon; Lukasiewicz, Thomas

Datensatz Zitierlink:

http://hdl.handle.net/20.500.12708/187543

Titel:

(Non-)Convergence Results for Predictive Coding Networks

Zitat:

Frieder, S., & Lukasiewicz, T. (2022). (Non-)Convergence Results for Predictive Coding Networks. In Proceedings of the 39th International Conference on Machine Learning (pp. 6793–6810). http://hdl.handle.net/20.500.12708/187543

Publikationstyp:

Konferenzbeitrag - Full-Paper Contribution

Sprache:

Englisch

Autor_innen:

Frieder, Simon
Lukasiewicz, Thomas

Organisationseinheit:

E192-07 - Forschungsbereich Artificial Intelligence Techniques
E192-03 - Forschungsbereich Knowledge Based Systems
E192 - Institut für Logic and Computation

Erschienen in:

Proceedings of the 39th International Conference on Machine Learning

Band:

162

Datum (veröffentlicht):

2022

Veranstaltungsname:

39th International Conference on Machine Learning (ICML 2022)

Veranstaltungszeitraum:

17-Jul-2022 - 23-Jul-2022

Veranstaltungsort:

Baltimore, Vereinigte Staaten von Amerika

Umfang:

Peer Reviewed:

Keywords:

predictive coding; convergence analysis; dynamical systems theory

Abstract:

Predictive coding networks (PCNs) are (un)supervised learning models, coming from neuroscience, that approximate how the brain works. One major open problem around PCNs is their convergence behavior. In this paper, we use dynamical systems theory to formally investigate the convergence of PCNs as they are used in machine learning. Doing so, we put their theory on a firm, rigorous basis, by developing a precise mathematical framework for PCN and show that for sufficiently small weights and initializations, PCNs converge for any input. Thereby, we provide the theoretical assurance that previous implementations, whose convergence was assessed solely by numerical experiments, can indeed capture the correct behavior of PCNs. Outside of the identified regime of small weights and small initializations, we show via a counterexample that PCNs can diverge, countering common beliefs held in the community. This is achieved by identifying a Neimark-Sacker bifurcation in a PCN of small size, which gives rise to an unstable fixed point and an invariant curve around it.

Link (extern):

https://proceedings.mlr.press/v162/frieder22a.html

Forschungsschwerpunkte:

Computer Engineering and Software-Intensive Systems: 100%

Wissenschaftszweig:

1020 - Informatik: 80%
1010 - Mathematik: 20%

Enthalten in den Sammlungen:

Conference Paper

Zur Langanzeige

Seiten Aufrufe

267

aufgerufen am 21.11.2023

Google Scholar^TM

Check

Seiten Aufrufe

Google ScholarTM

Google Scholar^TM