<div class="csl-bib-body">
<div class="csl-entry">Frieder, S., & Lukasiewicz, T. (2022). (Non-)Convergence Results for Predictive Coding Networks. In <i>Proceedings of the 39th International Conference on Machine Learning</i> (pp. 6793–6810). http://hdl.handle.net/20.500.12708/187543</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/187543
-
dc.description.abstract
Predictive coding networks (PCNs) are (un)supervised learning models, coming from neuroscience, that approximate how the brain works. One major open problem around PCNs is their convergence behavior. In this paper, we use dynamical systems theory to formally investigate the convergence of PCNs as they are used in machine learning. Doing so, we put their theory on a firm, rigorous basis, by developing a precise mathematical framework for PCN and show that for sufficiently small weights and initializations, PCNs converge for any input. Thereby, we provide the theoretical assurance that previous implementations, whose convergence was assessed solely by numerical experiments, can indeed capture the correct behavior of PCNs. Outside of the identified regime of small weights and small initializations, we show via a counterexample that PCNs can diverge, countering common beliefs held in the community. This is achieved by identifying a Neimark-Sacker bifurcation in a PCN of small size, which gives rise to an unstable fixed point and an invariant curve around it.
en
dc.language.iso
en
-
dc.relation.ispartofseries
Proceedings of Machine Learning Research
-
dc.subject
predictive coding
en
dc.subject
convergence analysis
en
dc.subject
dynamical systems theory
en
dc.title
(Non-)Convergence Results for Predictive Coding Networks
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.affiliation
University of Oxford, United Kingdom of Great Britain and Northern Ireland (the)
-
dc.description.startpage
6793
-
dc.description.endpage
6810
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
Proceedings of the 39th International Conference on Machine Learning
-
tuw.container.volume
162
-
tuw.peerreviewed
true
-
tuw.researchTopic.id
I2
-
tuw.researchTopic.name
Computer Engineering and Software-Intensive Systems