When theory and biology differ: The relationship between reward prediction errors and expectancy

•The relationship between reward expectancy and the reward positivity is sigmoidal.•This non-linear trend follows the biological principles of the dopamine system.•Our results suggest a revision of reinforcement learning mathematical models. Comparisons between expectations and outcomes are critical...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Biological psychology 2017-10, Vol.129, p.265-272
Hauptverfasser:	Williams, Chad C., Hassall, Cameron D., Trska, Robert, Holroyd, Clay B., Krigolson, Olave E.
Format:	Artikel
Sprache:	eng
Schlagworte:	Brain - physiology Dopamine Electroencephalography Evoked Potentials - physiology Feedback-related negativity Female Humans Learning - physiology Male Prediction error Reinforcement learning Rescorla-Wagner learning rule Reward Reward positivity Young Adult
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•The relationship between reward expectancy and the reward positivity is sigmoidal.•This non-linear trend follows the biological principles of the dopamine system.•Our results suggest a revision of reinforcement learning mathematical models. Comparisons between expectations and outcomes are critical for learning. Termed prediction errors, the violations of expectancy that occur when outcomes differ from expectations are used to modify value and shape behaviour. In the present study, we examined how a wide range of expectancy violations impacted neural signals associated with feedback processing. Participants performed a time estimation task in which they had to guess the duration of one second while their electroencephalogram was recorded. In a key manipulation, we varied task difficulty across the experiment to create a range of different feedback expectancies − reward feedback was either very expected, expected, 50/50, unexpected, or very unexpected. As predicted, the amplitude of the reward positivity, a component of the human event-related brain potential associated with feedback processing, scaled inversely with expectancy (e.g., unexpected feedback yielded a larger reward positivity than expected feedback). Interestingly, the scaling of the reward positivity to outcome expectancy was not linear as would be predicted by some theoretical models. Specifically, we found that the amplitude of the reward positivity was about equivalent for very expected and expected feedback, and for very unexpected and unexpected feedback. As such, our results demonstrate a sigmoidal relationship between reward expectancy and the amplitude of the reward positivity, with interesting implications for theories of reinforcement learning.
ISSN:	0301-0511 1873-6246
DOI:	10.1016/j.biopsycho.2017.09.007