Reconciling shared versus context-specific information in a neural network model of latent causes

It has been proposed that, when processing a stream of events, humans divide their experiences in terms of inferred latent causes (LCs) to support context-dependent learning. However, when shared structure is present across contexts, it is still unclear how the “splitting” of LCs and learning of sha...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Scientific reports 2024-07, Vol.14 (1), p.16782-15, Article 16782
Hauptverfasser:	Lu, Qihong, Nguyen, Tan T., Zhang, Qiong, Hasson, Uri, Griffiths, Thomas L., Zacks, Jeffrey M., Gershman, Samuel J., Norman, Kenneth A.
Format:	Artikel
Sprache:	eng
Schlagworte:	631/378/1595 631/477 631/477/2811 Algorithms Bayes Theorem Bayesian analysis Humanities and Social Sciences Humans Information processing Learning multidisciplinary Neural networks Neural Networks, Computer Science Science (multidisciplinary) Structure-function relationships
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	It has been proposed that, when processing a stream of events, humans divide their experiences in terms of inferred latent causes (LCs) to support context-dependent learning. However, when shared structure is present across contexts, it is still unclear how the “splitting” of LCs and learning of shared structure can be simultaneously achieved. Here, we present the Latent Cause Network (LCNet), a neural network model of LC inference. Through learning, it naturally stores structure that is shared across tasks in the network weights. Additionally, it represents context-specific structure using a context module, controlled by a Bayesian nonparametric inference algorithm, which assigns a unique context vector for each inferred LC. Across three simulations, we found that LCNet could (1) extract shared structure across LCs in a function learning task while avoiding catastrophic interference, (2) capture human data on curriculum effects in schema learning, and (3) infer the underlying event structure when processing naturalistic videos of daily events. Overall, these results demonstrate a computationally feasible approach to reconciling shared structure and context-specific structure in a model of LCs that is scalable from laboratory experiment settings to naturalistic settings.
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-64272-5