Learning, Reward, and Decision Making

In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a simil...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Annual review of psychology 2017-01, Vol.68 (1), p.73-100
Hauptverfasser:	O'Doherty, John P, Cockburn, Jeffrey, Pauli, Wolfgang M
Format:	Artikel
Sprache:	eng
Schlagworte:	Behavior Brain - physiology cognitive map Conditioning (Psychology) - physiology Decision making Decision Making - physiology Goals Humans instrumental Learning Learning - physiology model based model free Neurosciences outcome valuation Pavlovian Reinforcement (Psychology) Reward Rewards
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	100
container_issue	1
container_start_page	73
container_title	Annual review of psychology
container_volume	68
creator	O'Doherty, John P Cockburn, Jeffrey Pauli, Wolfgang M
description	In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.
doi_str_mv	10.1146/annurev-psych-010416-044216
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_1855788400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>4309552231</sourcerecordid><originalsourceid>FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</originalsourceid><addsrcrecordid>eNqVkV1LwzAUhoMobk7_ggyG4MWqOWmaJgiizE-YCKLXIW3TrbNLZ7Ju7N-b2TnUO3ORXJznvMnJg1AP8BkAZefKmNrqRTBzq3QcYMAUWIApJcB2UBsiGgUE82gXtTFmLKAh5i104NwE-8Uivo9aJGY8BhBtdDLUyprCjPrdF71UNut3lcm6NzotXFGZ7pN698VDtJer0umjzdlBb3e3r4OHYPh8_zi4HgaKQTgPciEEz1SWEsU4FgRDqCmOozDKKecZxUDyNIz9nuSMCZXTlCYiEyQEnqg8DDvossmd1clUZ6k2c6tKObPFVNmVrFQhf1dMMZajaiEZCMLi2AecbgJs9VFrN5fTwqW6LJXRVe0k8CiKOacYe7T3B51UtTV-PE8xwhgAp566aKjUVs5ZnW8fA1iudciNDvmlQzY6ZKPDdx__nGfb-_3_HrhqgHWKKn1OoZfuX3d8Aos9ntw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1862661184</pqid></control><display><type>article</type><title>Learning, Reward, and Decision Making</title><source>Annual Reviews Complete A-Z List</source><source>MEDLINE</source><creator>O'Doherty, John P ; Cockburn, Jeffrey ; Pauli, Wolfgang M</creator><creatorcontrib>O'Doherty, John P ; Cockburn, Jeffrey ; Pauli, Wolfgang M</creatorcontrib><description>In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.</description><identifier>ISSN: 0066-4308</identifier><identifier>EISSN: 1545-2085</identifier><identifier>DOI: 10.1146/annurev-psych-010416-044216</identifier><identifier>PMID: 27687119</identifier><identifier>CODEN: ARPSAC</identifier><language>eng</language><publisher>United States: Annual Reviews</publisher><subject>Behavior ; Brain - physiology ; cognitive map ; Conditioning (Psychology) - physiology ; Decision making ; Decision Making - physiology ; Goals ; Humans ; instrumental ; Learning ; Learning - physiology ; model based ; model free ; Neurosciences ; outcome valuation ; Pavlovian ; Reinforcement (Psychology) ; Reward ; Rewards</subject><ispartof>Annual review of psychology, 2017-01, Vol.68 (1), p.73-100</ispartof><rights>Copyright © 2017 by Annual Reviews. All rights reserved 2017</rights><rights>Copyright Annual Reviews, Inc. 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</citedby><cites>FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.annualreviews.org/content/journals/10.1146/annurev-psych-010416-044216?crawler=true&mimetype=application/pdf$$EPDF$$P50$$Gannualreviews$$H</linktopdf><linktohtml>$$Uhttps://www.annualreviews.org/content/journals/10.1146/annurev-psych-010416-044216$$EHTML$$P50$$Gannualreviews$$H</linktohtml><link.rule.ids>70,230,314,777,781,882,4168,27905,27906,78003,78004</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27687119$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>O'Doherty, John P</creatorcontrib><creatorcontrib>Cockburn, Jeffrey</creatorcontrib><creatorcontrib>Pauli, Wolfgang M</creatorcontrib><title>Learning, Reward, and Decision Making</title><title>Annual review of psychology</title><addtitle>Annu Rev Psychol</addtitle><description>In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.</description><subject>Behavior</subject><subject>Brain - physiology</subject><subject>cognitive map</subject><subject>Conditioning (Psychology) - physiology</subject><subject>Decision making</subject><subject>Decision Making - physiology</subject><subject>Goals</subject><subject>Humans</subject><subject>instrumental</subject><subject>Learning</subject><subject>Learning - physiology</subject><subject>model based</subject><subject>model free</subject><subject>Neurosciences</subject><subject>outcome valuation</subject><subject>Pavlovian</subject><subject>Reinforcement (Psychology)</subject><subject>Reward</subject><subject>Rewards</subject><issn>0066-4308</issn><issn>1545-2085</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqVkV1LwzAUhoMobk7_ggyG4MWqOWmaJgiizE-YCKLXIW3TrbNLZ7Ju7N-b2TnUO3ORXJznvMnJg1AP8BkAZefKmNrqRTBzq3QcYMAUWIApJcB2UBsiGgUE82gXtTFmLKAh5i104NwE-8Uivo9aJGY8BhBtdDLUyprCjPrdF71UNut3lcm6NzotXFGZ7pN698VDtJer0umjzdlBb3e3r4OHYPh8_zi4HgaKQTgPciEEz1SWEsU4FgRDqCmOozDKKecZxUDyNIz9nuSMCZXTlCYiEyQEnqg8DDvossmd1clUZ6k2c6tKObPFVNmVrFQhf1dMMZajaiEZCMLi2AecbgJs9VFrN5fTwqW6LJXRVe0k8CiKOacYe7T3B51UtTV-PE8xwhgAp566aKjUVs5ZnW8fA1iudciNDvmlQzY6ZKPDdx__nGfb-_3_HrhqgHWKKn1OoZfuX3d8Aos9ntw</recordid><startdate>20170103</startdate><enddate>20170103</enddate><creator>O'Doherty, John P</creator><creator>Cockburn, Jeffrey</creator><creator>Pauli, Wolfgang M</creator><general>Annual Reviews</general><general>Annual Reviews, Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8BJ</scope><scope>FQK</scope><scope>JBE</scope><scope>K9.</scope><scope>NAPCQ</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20170103</creationdate><title>Learning, Reward, and Decision Making</title><author>O'Doherty, John P ; Cockburn, Jeffrey ; Pauli, Wolfgang M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Behavior</topic><topic>Brain - physiology</topic><topic>cognitive map</topic><topic>Conditioning (Psychology) - physiology</topic><topic>Decision making</topic><topic>Decision Making - physiology</topic><topic>Goals</topic><topic>Humans</topic><topic>instrumental</topic><topic>Learning</topic><topic>Learning - physiology</topic><topic>model based</topic><topic>model free</topic><topic>Neurosciences</topic><topic>outcome valuation</topic><topic>Pavlovian</topic><topic>Reinforcement (Psychology)</topic><topic>Reward</topic><topic>Rewards</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>O'Doherty, John P</creatorcontrib><creatorcontrib>Cockburn, Jeffrey</creatorcontrib><creatorcontrib>Pauli, Wolfgang M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>International Bibliography of the Social Sciences</collection><collection>International Bibliography of the Social Sciences</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Nursing & Allied Health Premium</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Annual review of psychology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>O'Doherty, John P</au><au>Cockburn, Jeffrey</au><au>Pauli, Wolfgang M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Learning, Reward, and Decision Making</atitle><jtitle>Annual review of psychology</jtitle><addtitle>Annu Rev Psychol</addtitle><date>2017-01-03</date><risdate>2017</risdate><volume>68</volume><issue>1</issue><spage>73</spage><epage>100</epage><pages>73-100</pages><issn>0066-4308</issn><eissn>1545-2085</eissn><coden>ARPSAC</coden><abstract>In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.</abstract><cop>United States</cop><pub>Annual Reviews</pub><pmid>27687119</pmid><doi>10.1146/annurev-psych-010416-044216</doi><tpages>28</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0066-4308
ispartof	Annual review of psychology, 2017-01, Vol.68 (1), p.73-100
issn	0066-4308 1545-2085
language	eng
recordid	cdi_proquest_miscellaneous_1855788400
source	Annual Reviews Complete A-Z List; MEDLINE
subjects	Behavior Brain - physiology cognitive map Conditioning (Psychology) - physiology Decision making Decision Making - physiology Goals Humans instrumental Learning Learning - physiology model based model free Neurosciences outcome valuation Pavlovian Reinforcement (Psychology) Reward Rewards
title	Learning, Reward, and Decision Making
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T09%3A59%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Learning,%20Reward,%20and%20Decision%20Making&rft.jtitle=Annual%20review%20of%20psychology&rft.au=O'Doherty,%20John%20P&rft.date=2017-01-03&rft.volume=68&rft.issue=1&rft.spage=73&rft.epage=100&rft.pages=73-100&rft.issn=0066-4308&rft.eissn=1545-2085&rft.coden=ARPSAC&rft_id=info:doi/10.1146/annurev-psych-010416-044216&rft_dat=%3Cproquest_pubme%3E4309552231%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1862661184&rft_id=info:pmid/27687119&rfr_iscdi=true