Learning, Reward, and Decision Making
In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a simil...
Gespeichert in:
Veröffentlicht in: | Annual review of psychology 2017-01, Vol.68 (1), p.73-100 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 100 |
---|---|
container_issue | 1 |
container_start_page | 73 |
container_title | Annual review of psychology |
container_volume | 68 |
creator | O'Doherty, John P Cockburn, Jeffrey Pauli, Wolfgang M |
description | In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior. |
doi_str_mv | 10.1146/annurev-psych-010416-044216 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_1855788400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>4309552231</sourcerecordid><originalsourceid>FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</originalsourceid><addsrcrecordid>eNqVkV1LwzAUhoMobk7_ggyG4MWqOWmaJgiizE-YCKLXIW3TrbNLZ7Ju7N-b2TnUO3ORXJznvMnJg1AP8BkAZefKmNrqRTBzq3QcYMAUWIApJcB2UBsiGgUE82gXtTFmLKAh5i104NwE-8Uivo9aJGY8BhBtdDLUyprCjPrdF71UNut3lcm6NzotXFGZ7pN698VDtJer0umjzdlBb3e3r4OHYPh8_zi4HgaKQTgPciEEz1SWEsU4FgRDqCmOozDKKecZxUDyNIz9nuSMCZXTlCYiEyQEnqg8DDvossmd1clUZ6k2c6tKObPFVNmVrFQhf1dMMZajaiEZCMLi2AecbgJs9VFrN5fTwqW6LJXRVe0k8CiKOacYe7T3B51UtTV-PE8xwhgAp566aKjUVs5ZnW8fA1iudciNDvmlQzY6ZKPDdx__nGfb-_3_HrhqgHWKKn1OoZfuX3d8Aos9ntw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1862661184</pqid></control><display><type>article</type><title>Learning, Reward, and Decision Making</title><source>Annual Reviews Complete A-Z List</source><source>MEDLINE</source><creator>O'Doherty, John P ; Cockburn, Jeffrey ; Pauli, Wolfgang M</creator><creatorcontrib>O'Doherty, John P ; Cockburn, Jeffrey ; Pauli, Wolfgang M</creatorcontrib><description>In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.</description><identifier>ISSN: 0066-4308</identifier><identifier>EISSN: 1545-2085</identifier><identifier>DOI: 10.1146/annurev-psych-010416-044216</identifier><identifier>PMID: 27687119</identifier><identifier>CODEN: ARPSAC</identifier><language>eng</language><publisher>United States: Annual Reviews</publisher><subject>Behavior ; Brain - physiology ; cognitive map ; Conditioning (Psychology) - physiology ; Decision making ; Decision Making - physiology ; Goals ; Humans ; instrumental ; Learning ; Learning - physiology ; model based ; model free ; Neurosciences ; outcome valuation ; Pavlovian ; Reinforcement (Psychology) ; Reward ; Rewards</subject><ispartof>Annual review of psychology, 2017-01, Vol.68 (1), p.73-100</ispartof><rights>Copyright © 2017 by Annual Reviews. All rights reserved 2017</rights><rights>Copyright Annual Reviews, Inc. 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</citedby><cites>FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.annualreviews.org/content/journals/10.1146/annurev-psych-010416-044216?crawler=true&mimetype=application/pdf$$EPDF$$P50$$Gannualreviews$$H</linktopdf><linktohtml>$$Uhttps://www.annualreviews.org/content/journals/10.1146/annurev-psych-010416-044216$$EHTML$$P50$$Gannualreviews$$H</linktohtml><link.rule.ids>70,230,314,777,781,882,4168,27905,27906,78003,78004</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27687119$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>O'Doherty, John P</creatorcontrib><creatorcontrib>Cockburn, Jeffrey</creatorcontrib><creatorcontrib>Pauli, Wolfgang M</creatorcontrib><title>Learning, Reward, and Decision Making</title><title>Annual review of psychology</title><addtitle>Annu Rev Psychol</addtitle><description>In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.</description><subject>Behavior</subject><subject>Brain - physiology</subject><subject>cognitive map</subject><subject>Conditioning (Psychology) - physiology</subject><subject>Decision making</subject><subject>Decision Making - physiology</subject><subject>Goals</subject><subject>Humans</subject><subject>instrumental</subject><subject>Learning</subject><subject>Learning - physiology</subject><subject>model based</subject><subject>model free</subject><subject>Neurosciences</subject><subject>outcome valuation</subject><subject>Pavlovian</subject><subject>Reinforcement (Psychology)</subject><subject>Reward</subject><subject>Rewards</subject><issn>0066-4308</issn><issn>1545-2085</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqVkV1LwzAUhoMobk7_ggyG4MWqOWmaJgiizE-YCKLXIW3TrbNLZ7Ju7N-b2TnUO3ORXJznvMnJg1AP8BkAZefKmNrqRTBzq3QcYMAUWIApJcB2UBsiGgUE82gXtTFmLKAh5i104NwE-8Uivo9aJGY8BhBtdDLUyprCjPrdF71UNut3lcm6NzotXFGZ7pN698VDtJer0umjzdlBb3e3r4OHYPh8_zi4HgaKQTgPciEEz1SWEsU4FgRDqCmOozDKKecZxUDyNIz9nuSMCZXTlCYiEyQEnqg8DDvossmd1clUZ6k2c6tKObPFVNmVrFQhf1dMMZajaiEZCMLi2AecbgJs9VFrN5fTwqW6LJXRVe0k8CiKOacYe7T3B51UtTV-PE8xwhgAp566aKjUVs5ZnW8fA1iudciNDvmlQzY6ZKPDdx__nGfb-_3_HrhqgHWKKn1OoZfuX3d8Aos9ntw</recordid><startdate>20170103</startdate><enddate>20170103</enddate><creator>O'Doherty, John P</creator><creator>Cockburn, Jeffrey</creator><creator>Pauli, Wolfgang M</creator><general>Annual Reviews</general><general>Annual Reviews, Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8BJ</scope><scope>FQK</scope><scope>JBE</scope><scope>K9.</scope><scope>NAPCQ</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20170103</creationdate><title>Learning, Reward, and Decision Making</title><author>O'Doherty, John P ; Cockburn, Jeffrey ; Pauli, Wolfgang M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a613t-f9998dadc2a68092013e407535f488d4012fc3712fbf669af4c4b9d92318baf33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Behavior</topic><topic>Brain - physiology</topic><topic>cognitive map</topic><topic>Conditioning (Psychology) - physiology</topic><topic>Decision making</topic><topic>Decision Making - physiology</topic><topic>Goals</topic><topic>Humans</topic><topic>instrumental</topic><topic>Learning</topic><topic>Learning - physiology</topic><topic>model based</topic><topic>model free</topic><topic>Neurosciences</topic><topic>outcome valuation</topic><topic>Pavlovian</topic><topic>Reinforcement (Psychology)</topic><topic>Reward</topic><topic>Rewards</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>O'Doherty, John P</creatorcontrib><creatorcontrib>Cockburn, Jeffrey</creatorcontrib><creatorcontrib>Pauli, Wolfgang M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>International Bibliography of the Social Sciences</collection><collection>International Bibliography of the Social Sciences</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Nursing & Allied Health Premium</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Annual review of psychology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>O'Doherty, John P</au><au>Cockburn, Jeffrey</au><au>Pauli, Wolfgang M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Learning, Reward, and Decision Making</atitle><jtitle>Annual review of psychology</jtitle><addtitle>Annu Rev Psychol</addtitle><date>2017-01-03</date><risdate>2017</risdate><volume>68</volume><issue>1</issue><spage>73</spage><epage>100</epage><pages>73-100</pages><issn>0066-4308</issn><eissn>1545-2085</eissn><coden>ARPSAC</coden><abstract>In this review, we summarize findings supporting the existence of multiple behavioral strategies for controlling reward-related behavior, including a dichotomy between the goal-directed or model-based system and the habitual or model-free system in the domain of instrumental conditioning and a similar dichotomy in the realm of Pavlovian conditioning. We evaluate evidence from neuroscience supporting the existence of at least partly distinct neuronal substrates contributing to the key computations necessary for the function of these different control systems. We consider the nature of the interactions between these systems and show how these interactions can lead to either adaptive or maladaptive behavioral outcomes. We then review evidence that an additional system guides inference concerning the hidden states of other agents, such as their beliefs, preferences, and intentions, in a social context. We also describe emerging evidence for an arbitration mechanism between model-based and model-free reinforcement learning, placing such a mechanism within the broader context of the hierarchical control of behavior.</abstract><cop>United States</cop><pub>Annual Reviews</pub><pmid>27687119</pmid><doi>10.1146/annurev-psych-010416-044216</doi><tpages>28</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0066-4308 |
ispartof | Annual review of psychology, 2017-01, Vol.68 (1), p.73-100 |
issn | 0066-4308 1545-2085 |
language | eng |
recordid | cdi_proquest_miscellaneous_1855788400 |
source | Annual Reviews Complete A-Z List; MEDLINE |
subjects | Behavior Brain - physiology cognitive map Conditioning (Psychology) - physiology Decision making Decision Making - physiology Goals Humans instrumental Learning Learning - physiology model based model free Neurosciences outcome valuation Pavlovian Reinforcement (Psychology) Reward Rewards |
title | Learning, Reward, and Decision Making |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T09%3A59%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Learning,%20Reward,%20and%20Decision%20Making&rft.jtitle=Annual%20review%20of%20psychology&rft.au=O'Doherty,%20John%20P&rft.date=2017-01-03&rft.volume=68&rft.issue=1&rft.spage=73&rft.epage=100&rft.pages=73-100&rft.issn=0066-4308&rft.eissn=1545-2085&rft.coden=ARPSAC&rft_id=info:doi/10.1146/annurev-psych-010416-044216&rft_dat=%3Cproquest_pubme%3E4309552231%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1862661184&rft_id=info:pmid/27687119&rfr_iscdi=true |