Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

Because learned associations are quickly renewed following extinction, the extinction process must include processes other than unlearning. However, reinforcement learning models, such as the temporal difference reinforcement learning (TDRL) model, treat extinction as an unlearning of associated val...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Psychological review 2007-07, Vol.114 (3), p.784-805
Hauptverfasser:	Redish, A. David, Jensen, Steve, Johnson, Adam, Kurth-Nelson, Zeb
Format:	Artikel
Sprache:	eng
Schlagworte:	Addiction Addictive behaviors Adult and adolescent clinical studies Animals Association Learning Behavior Change Behavior Problems Behavioural psychology Biochemistry Biological and medical sciences Cognitive psychology Cues Decision Making Dopamine Drug addiction Educational theory Extinction (Learning) Extinction, Psychological Gambling Gambling - psychology Gambling Disorder Humans Learning Learning Processes Medical sciences Mental Recall Miscellaneous Models, Statistical Motivation Probability Learning Psychology of learning Psychology. Psychoanalysis. Psychiatry Psychopathology. Psychiatry Recidivism Recurrence Reinforcement Reinforcement Schedule Relapse (Disorders) Rewards Substance-Related Disorders - psychology Time
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Because learned associations are quickly renewed following extinction, the extinction process must include processes other than unlearning. However, reinforcement learning models, such as the temporal difference reinforcement learning (TDRL) model, treat extinction as an unlearning of associated value and are thus unable to capture renewal. TDRL models are based on the hypothesis that dopamine carries a reward prediction error signal; these models predict reward by driving that reward error to zero. The authors construct a TDRL model that can accommodate extinction and renewal through two simple processes: (a) a TDRL process that learns the value of situation-action pairs and (b) a situation recognition process that categorizes the observed cues into situations. This model has implications for dysfunctional states, including relapse after addiction and problem gambling.
ISSN:	0033-295X 1939-1471
DOI:	10.1037/0033-295X.114.3.784