Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing Maps

Human learning and intelligence work differently from the supervised pattern recognition approach adopted in most deep learning architectures. Humans seem to learn rich representations by exploration and imitation, build causal models of the world, and use both to flexibly solve new tasks. We sugges...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computational intelligence and neuroscience 2021, Vol.2021 (1), p.5590445-5590445
Hauptverfasser:	Stetter, Martin, Lang, Elmar W.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Biomechanical Phenomena Deep learning Exploration Humans Imitative Behavior Inference Intelligence Kinematics Machine learning Neural networks Pattern recognition Physical properties Physics Planning Representations Self organizing maps
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Human learning and intelligence work differently from the supervised pattern recognition approach adopted in most deep learning architectures. Humans seem to learn rich representations by exploration and imitation, build causal models of the world, and use both to flexibly solve new tasks. We suggest a simple but effective unsupervised model which develops such characteristics. The agent learns to represent the dynamical physical properties of its environment by intrinsically motivated exploration and performs inference on this representation to reach goals. For this, a set of self-organizing maps which represent state-action pairs is combined with a causal model for sequence prediction. The proposed system is evaluated in the cartpole environment. After an initial phase of playful exploration, the agent can execute kinematic simulations of the environment’s future and use those for action planning. We demonstrate its performance on a set of several related, but different one-shot imitation tasks, which the agent flexibly solves in an active inference style.
ISSN:	1687-5265 1687-5273
DOI:	10.1155/2021/5590445