Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation
It has been suggested that dopamine (DA) represents reward-prediction-error (RPE) defined in reinforcement learning and therefore DA responds to unpredicted but not predicted reward. However, recent studies have found DA response sustained towards predictable reward in tasks involving self-paced beh...
Gespeichert in:
Veröffentlicht in: | PLoS computational biology 2016-10, Vol.12 (10), p.e1005145-e1005145 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e1005145 |
---|---|
container_issue | 10 |
container_start_page | e1005145 |
container_title | PLoS computational biology |
container_volume | 12 |
creator | Kato, Ayaka Morita, Kenji |
description | It has been suggested that dopamine (DA) represents reward-prediction-error (RPE) defined in reinforcement learning and therefore DA responds to unpredicted but not predicted reward. However, recent studies have found DA response sustained towards predictable reward in tasks involving self-paced behavior, and suggested that this response represents a motivational signal. We have previously shown that RPE can sustain if there is decay/forgetting of learned-values, which can be implemented as decay of synaptic strengths storing learned-values. This account, however, did not explain the suggested link between tonic/sustained DA and motivation. In the present work, we explored the motivational effects of the value-decay in self-paced approach behavior, modeled as a series of 'Go' or 'No-Go' selections towards a goal. Through simulations, we found that the value-decay can enhance motivation, specifically, facilitate fast goal-reaching, albeit counterintuitively. Mathematical analyses revealed that underlying potential mechanisms are twofold: (1) decay-induced sustained RPE creates a gradient of 'Go' values towards a goal, and (2) value-contrasts between 'Go' and 'No-Go' are generated because while chosen values are continually updated, unchosen values simply decay. Our model provides potential explanations for the key experimental findings that suggest DA's roles in motivation: (i) slowdown of behavior by post-training blockade of DA signaling, (ii) observations that DA blockade severely impairs effortful actions to obtain rewards while largely sparing seeking of easily obtainable rewards, and (iii) relationships between the reward amount, the level of motivation reflected in the speed of behavior, and the average level of DA. These results indicate that reinforcement learning with value-decay, or forgetting, provides a parsimonious mechanistic account for the DA's roles in value-learning and motivation. Our results also suggest that when biological systems for value-learning are active even though learning has apparently converged, the systems might be in a state of dynamic equilibrium, where learning and forgetting are balanced. |
doi_str_mv | 10.1371/journal.pcbi.1005145 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1840944030</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A472228544</galeid><doaj_id>oai_doaj_org_article_4e537a01f2224c9a9da82adb503d203a</doaj_id><sourcerecordid>A472228544</sourcerecordid><originalsourceid>FETCH-LOGICAL-c830t-b64bd213096362466393b39aafbde61b8c999a860961dc5a0dfe93d45a70af513</originalsourceid><addsrcrecordid>eNqVkk1v1DAQhiMEoqXwDxBE4gKHXez4I8mlUlUorLSA1IWzNbGd4CWxF9up4N_jdNOqi3oA5ZDRzDOv5yvLnmO0xKTEb7du9Bb65U42ZokRYpiyB9kxZowsSsKqh3fso-xJCFuEklnzx9lRUZaEVxU-zjYXznc6RmO73Nj8UhvbOi_1oG3M1xq8nSJrY3-EfDOGCMZqlb9zOxiSlW9Ml2oIeXT5JxfNFUTj7NPsUZuc-tn8P8m-Xbz_ev5xsf7yYXV-tl7IiqC4aDhtVIEJqjnhBeWc1KQhNUDbKM1xU8m6rqHiKY6VZIBUq2uiKIMSQcswOcle7nV3vQtinkcQuKKophQRlIjVnlAOtmLnzQD-t3BgxLUjtS7ARyN7LahmpASE26IoqKyhVlAVoBqGiCoQgaR1Or82NoNWMg3IQ38gehix5rvo3JVgiBOKSRJ4PQt493PUIYrBBKn7Hqx243XdnKKi-ieUMJqWiKchvPoLvX8QM9VB6nXacSpRTqLijJap5YpRmqjlPVT6lB6MdFa3JvkPEt4cJCQm6l-xgzEEsdpc_gf7-ZCle1Z6F4LX7e2YMRLT9d80KabrF_P1p7QXd1d0m3Rz7uQP7JT-pg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1840944030</pqid></control><display><type>article</type><title>Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Kato, Ayaka ; Morita, Kenji</creator><contributor>Gutkin, Boris S.</contributor><creatorcontrib>Kato, Ayaka ; Morita, Kenji ; Gutkin, Boris S.</creatorcontrib><description>It has been suggested that dopamine (DA) represents reward-prediction-error (RPE) defined in reinforcement learning and therefore DA responds to unpredicted but not predicted reward. However, recent studies have found DA response sustained towards predictable reward in tasks involving self-paced behavior, and suggested that this response represents a motivational signal. We have previously shown that RPE can sustain if there is decay/forgetting of learned-values, which can be implemented as decay of synaptic strengths storing learned-values. This account, however, did not explain the suggested link between tonic/sustained DA and motivation. In the present work, we explored the motivational effects of the value-decay in self-paced approach behavior, modeled as a series of 'Go' or 'No-Go' selections towards a goal. Through simulations, we found that the value-decay can enhance motivation, specifically, facilitate fast goal-reaching, albeit counterintuitively. Mathematical analyses revealed that underlying potential mechanisms are twofold: (1) decay-induced sustained RPE creates a gradient of 'Go' values towards a goal, and (2) value-contrasts between 'Go' and 'No-Go' are generated because while chosen values are continually updated, unchosen values simply decay. Our model provides potential explanations for the key experimental findings that suggest DA's roles in motivation: (i) slowdown of behavior by post-training blockade of DA signaling, (ii) observations that DA blockade severely impairs effortful actions to obtain rewards while largely sparing seeking of easily obtainable rewards, and (iii) relationships between the reward amount, the level of motivation reflected in the speed of behavior, and the average level of DA. These results indicate that reinforcement learning with value-decay, or forgetting, provides a parsimonious mechanistic account for the DA's roles in value-learning and motivation. Our results also suggest that when biological systems for value-learning are active even though learning has apparently converged, the systems might be in a state of dynamic equilibrium, where learning and forgetting are balanced.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1005145</identifier><identifier>PMID: 27736881</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Biology and Life Sciences ; Computer Simulation ; Corpus Striatum - physiology ; Decision Making - physiology ; Dopamine ; Dopamine - metabolism ; Dopaminergic Neurons - physiology ; Education ; Funding ; Health aspects ; Humans ; Medicine and Health Sciences ; Mental Recall - physiology ; Metabolism ; Models, Neurological ; Motivation ; Motivation - physiology ; Observations ; Physical Sciences ; R&D ; Reinforcement (Psychology) ; Research & development ; Research and Analysis Methods ; Roles ; Social Sciences ; Studies ; Velocity</subject><ispartof>PLoS computational biology, 2016-10, Vol.12 (10), p.e1005145-e1005145</ispartof><rights>COPYRIGHT 2016 Public Library of Science</rights><rights>2016 Public Library of Science. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Kato A, Morita K (2016) Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation. PLoS Comput Biol 12(10): e1005145. doi:10.1371/journal.pcbi.1005145</rights><rights>2016 Kato, Morita 2016 Kato, Morita</rights><rights>2016 Public Library of Science. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Kato A, Morita K (2016) Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation. PLoS Comput Biol 12(10): e1005145. doi:10.1371/journal.pcbi.1005145</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c830t-b64bd213096362466393b39aafbde61b8c999a860961dc5a0dfe93d45a70af513</citedby><cites>FETCH-LOGICAL-c830t-b64bd213096362466393b39aafbde61b8c999a860961dc5a0dfe93d45a70af513</cites><orcidid>0000-0002-6306-6600</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5063413/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5063413/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,2102,2928,23866,27924,27925,53791,53793,79600,79601</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27736881$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Gutkin, Boris S.</contributor><creatorcontrib>Kato, Ayaka</creatorcontrib><creatorcontrib>Morita, Kenji</creatorcontrib><title>Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>It has been suggested that dopamine (DA) represents reward-prediction-error (RPE) defined in reinforcement learning and therefore DA responds to unpredicted but not predicted reward. However, recent studies have found DA response sustained towards predictable reward in tasks involving self-paced behavior, and suggested that this response represents a motivational signal. We have previously shown that RPE can sustain if there is decay/forgetting of learned-values, which can be implemented as decay of synaptic strengths storing learned-values. This account, however, did not explain the suggested link between tonic/sustained DA and motivation. In the present work, we explored the motivational effects of the value-decay in self-paced approach behavior, modeled as a series of 'Go' or 'No-Go' selections towards a goal. Through simulations, we found that the value-decay can enhance motivation, specifically, facilitate fast goal-reaching, albeit counterintuitively. Mathematical analyses revealed that underlying potential mechanisms are twofold: (1) decay-induced sustained RPE creates a gradient of 'Go' values towards a goal, and (2) value-contrasts between 'Go' and 'No-Go' are generated because while chosen values are continually updated, unchosen values simply decay. Our model provides potential explanations for the key experimental findings that suggest DA's roles in motivation: (i) slowdown of behavior by post-training blockade of DA signaling, (ii) observations that DA blockade severely impairs effortful actions to obtain rewards while largely sparing seeking of easily obtainable rewards, and (iii) relationships between the reward amount, the level of motivation reflected in the speed of behavior, and the average level of DA. These results indicate that reinforcement learning with value-decay, or forgetting, provides a parsimonious mechanistic account for the DA's roles in value-learning and motivation. Our results also suggest that when biological systems for value-learning are active even though learning has apparently converged, the systems might be in a state of dynamic equilibrium, where learning and forgetting are balanced.</description><subject>Biology and Life Sciences</subject><subject>Computer Simulation</subject><subject>Corpus Striatum - physiology</subject><subject>Decision Making - physiology</subject><subject>Dopamine</subject><subject>Dopamine - metabolism</subject><subject>Dopaminergic Neurons - physiology</subject><subject>Education</subject><subject>Funding</subject><subject>Health aspects</subject><subject>Humans</subject><subject>Medicine and Health Sciences</subject><subject>Mental Recall - physiology</subject><subject>Metabolism</subject><subject>Models, Neurological</subject><subject>Motivation</subject><subject>Motivation - physiology</subject><subject>Observations</subject><subject>Physical Sciences</subject><subject>R&D</subject><subject>Reinforcement (Psychology)</subject><subject>Research & development</subject><subject>Research and Analysis Methods</subject><subject>Roles</subject><subject>Social Sciences</subject><subject>Studies</subject><subject>Velocity</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqVkk1v1DAQhiMEoqXwDxBE4gKHXez4I8mlUlUorLSA1IWzNbGd4CWxF9up4N_jdNOqi3oA5ZDRzDOv5yvLnmO0xKTEb7du9Bb65U42ZokRYpiyB9kxZowsSsKqh3fso-xJCFuEklnzx9lRUZaEVxU-zjYXznc6RmO73Nj8UhvbOi_1oG3M1xq8nSJrY3-EfDOGCMZqlb9zOxiSlW9Ml2oIeXT5JxfNFUTj7NPsUZuc-tn8P8m-Xbz_ev5xsf7yYXV-tl7IiqC4aDhtVIEJqjnhBeWc1KQhNUDbKM1xU8m6rqHiKY6VZIBUq2uiKIMSQcswOcle7nV3vQtinkcQuKKophQRlIjVnlAOtmLnzQD-t3BgxLUjtS7ARyN7LahmpASE26IoqKyhVlAVoBqGiCoQgaR1Or82NoNWMg3IQ38gehix5rvo3JVgiBOKSRJ4PQt493PUIYrBBKn7Hqx243XdnKKi-ieUMJqWiKchvPoLvX8QM9VB6nXacSpRTqLijJap5YpRmqjlPVT6lB6MdFa3JvkPEt4cJCQm6l-xgzEEsdpc_gf7-ZCle1Z6F4LX7e2YMRLT9d80KabrF_P1p7QXd1d0m3Rz7uQP7JT-pg</recordid><startdate>20161001</startdate><enddate>20161001</enddate><creator>Kato, Ayaka</creator><creator>Morita, Kenji</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7QP</scope><scope>7TK</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-6306-6600</orcidid></search><sort><creationdate>20161001</creationdate><title>Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation</title><author>Kato, Ayaka ; Morita, Kenji</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c830t-b64bd213096362466393b39aafbde61b8c999a860961dc5a0dfe93d45a70af513</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Biology and Life Sciences</topic><topic>Computer Simulation</topic><topic>Corpus Striatum - physiology</topic><topic>Decision Making - physiology</topic><topic>Dopamine</topic><topic>Dopamine - metabolism</topic><topic>Dopaminergic Neurons - physiology</topic><topic>Education</topic><topic>Funding</topic><topic>Health aspects</topic><topic>Humans</topic><topic>Medicine and Health Sciences</topic><topic>Mental Recall - physiology</topic><topic>Metabolism</topic><topic>Models, Neurological</topic><topic>Motivation</topic><topic>Motivation - physiology</topic><topic>Observations</topic><topic>Physical Sciences</topic><topic>R&D</topic><topic>Reinforcement (Psychology)</topic><topic>Research & development</topic><topic>Research and Analysis Methods</topic><topic>Roles</topic><topic>Social Sciences</topic><topic>Studies</topic><topic>Velocity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kato, Ayaka</creatorcontrib><creatorcontrib>Morita, Kenji</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kato, Ayaka</au><au>Morita, Kenji</au><au>Gutkin, Boris S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2016-10-01</date><risdate>2016</risdate><volume>12</volume><issue>10</issue><spage>e1005145</spage><epage>e1005145</epage><pages>e1005145-e1005145</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>It has been suggested that dopamine (DA) represents reward-prediction-error (RPE) defined in reinforcement learning and therefore DA responds to unpredicted but not predicted reward. However, recent studies have found DA response sustained towards predictable reward in tasks involving self-paced behavior, and suggested that this response represents a motivational signal. We have previously shown that RPE can sustain if there is decay/forgetting of learned-values, which can be implemented as decay of synaptic strengths storing learned-values. This account, however, did not explain the suggested link between tonic/sustained DA and motivation. In the present work, we explored the motivational effects of the value-decay in self-paced approach behavior, modeled as a series of 'Go' or 'No-Go' selections towards a goal. Through simulations, we found that the value-decay can enhance motivation, specifically, facilitate fast goal-reaching, albeit counterintuitively. Mathematical analyses revealed that underlying potential mechanisms are twofold: (1) decay-induced sustained RPE creates a gradient of 'Go' values towards a goal, and (2) value-contrasts between 'Go' and 'No-Go' are generated because while chosen values are continually updated, unchosen values simply decay. Our model provides potential explanations for the key experimental findings that suggest DA's roles in motivation: (i) slowdown of behavior by post-training blockade of DA signaling, (ii) observations that DA blockade severely impairs effortful actions to obtain rewards while largely sparing seeking of easily obtainable rewards, and (iii) relationships between the reward amount, the level of motivation reflected in the speed of behavior, and the average level of DA. These results indicate that reinforcement learning with value-decay, or forgetting, provides a parsimonious mechanistic account for the DA's roles in value-learning and motivation. Our results also suggest that when biological systems for value-learning are active even though learning has apparently converged, the systems might be in a state of dynamic equilibrium, where learning and forgetting are balanced.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>27736881</pmid><doi>10.1371/journal.pcbi.1005145</doi><orcidid>https://orcid.org/0000-0002-6306-6600</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1553-7358 |
ispartof | PLoS computational biology, 2016-10, Vol.12 (10), p.e1005145-e1005145 |
issn | 1553-7358 1553-734X 1553-7358 |
language | eng |
recordid | cdi_plos_journals_1840944030 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Public Library of Science (PLoS) Journals Open Access; EZB-FREE-00999 freely available EZB journals; PubMed Central |
subjects | Biology and Life Sciences Computer Simulation Corpus Striatum - physiology Decision Making - physiology Dopamine Dopamine - metabolism Dopaminergic Neurons - physiology Education Funding Health aspects Humans Medicine and Health Sciences Mental Recall - physiology Metabolism Models, Neurological Motivation Motivation - physiology Observations Physical Sciences R&D Reinforcement (Psychology) Research & development Research and Analysis Methods Roles Social Sciences Studies Velocity |
title | Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T17%3A42%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Forgetting%20in%20Reinforcement%20Learning%20Links%20Sustained%20Dopamine%20Signals%20to%20Motivation&rft.jtitle=PLoS%20computational%20biology&rft.au=Kato,%20Ayaka&rft.date=2016-10-01&rft.volume=12&rft.issue=10&rft.spage=e1005145&rft.epage=e1005145&rft.pages=e1005145-e1005145&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1005145&rft_dat=%3Cgale_plos_%3EA472228544%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1840944030&rft_id=info:pmid/27736881&rft_galeid=A472228544&rft_doaj_id=oai_doaj_org_article_4e537a01f2224c9a9da82adb503d203a&rfr_iscdi=true |