Changes in the stimulus‐preceding negativity and lateralized readiness potential during reinforcement learning

According to reinforcement learning theory, dopamine‐dependent anticipatory processes play a critical role in learning from action outcomes such as feedback or reward. To better understand outcome anticipation, we examined variation in slow cortical potentials and assessed their changes over the cou...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Psychophysiology 2017-07, Vol.54 (7), p.969-981
Hauptverfasser: Ren, Xi, Valle‐Inclán, Fernando, Tukaiev, Sergii, Hackley, Steven A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:According to reinforcement learning theory, dopamine‐dependent anticipatory processes play a critical role in learning from action outcomes such as feedback or reward. To better understand outcome anticipation, we examined variation in slow cortical potentials and assessed their changes over the course of motor‐skill acquisition. Healthy young adults learned a series of precisely timed, key press sequences. Feedback was delivered at a delay of either 2.5 or 8 s, to encourage use of either the striatally mediated, habit learning system or the hippocampus‐dependent, episodic memory system, respectively. During the 2.5‐s delay, the stimulus‐preceding negativity (SPN) was shown to decline in amplitude across trials, confirming previous results from a perceptual categorization task (Morís, Luque, & Rodríguez‐Fornells, 2013). This falsifies the hypothesis that SPN reflects specific outcome predictions, on the assumption that the ability to make such predictions should improve as a task is mastered. An SPN was also evident during the 8‐s delay, but it increased in amplitude across trials. At the conclusion of the 8‐s but not the 2.5‐s prefeedback interval, a reversed‐polarity lateralized readiness potential (LRP) was noted. It was suggested that this might indicate maintenance of an action representation for comparison with the feedback display. If so, this would constitute the first direct psychophysiological evidence for a popular hypothetical construct in quantitative models of reinforcement learning, the so‐called eligibility trace.
ISSN:0048-5772
1469-8986
1540-5958
DOI:10.1111/psyp.12859