Recurrent networks endowed with structural priors explain suboptimal animal behavior

The strategies found by animals facing a new task are determined both by individual experience and by structural priors evolved to leverage the statistics of natural environments. Rats quickly learn to capitalize on the trial sequence correlations of two-alternative forced choice (2AFC) tasks after...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Current biology 2023-02, Vol.33 (4), p.622-638.e7
Hauptverfasser: Molano-Mazón, Manuel, Shao, Yuxiu, Duque, Daniel, Yang, Guangyu Robert, Ostojic, Srdjan, de la Rocha, Jaime
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 638.e7
container_issue 4
container_start_page 622
container_title Current biology
container_volume 33
creator Molano-Mazón, Manuel
Shao, Yuxiu
Duque, Daniel
Yang, Guangyu Robert
Ostojic, Srdjan
de la Rocha, Jaime
description The strategies found by animals facing a new task are determined both by individual experience and by structural priors evolved to leverage the statistics of natural environments. Rats quickly learn to capitalize on the trial sequence correlations of two-alternative forced choice (2AFC) tasks after correct trials but consistently deviate from optimal behavior after error trials. To understand this outcome-dependent gating, we first show that recurrent neural networks (RNNs) trained in the same 2AFC task outperform rats as they can readily learn to use across-trial information both after correct and error trials. We hypothesize that, although RNNs can optimize their behavior in the 2AFC task without any a priori restrictions, rats’ strategy is constrained by a structural prior adapted to a natural environment in which rewarded and non-rewarded actions provide largely asymmetric information. When pre-training RNNs in a more ecological task with more than two possible choices, networks develop a strategy by which they gate off the across-trial evidence after errors, mimicking rats’ behavior. Population analyses show that the pre-trained networks form an accurate representation of the sequence statistics independently of the outcome in the previous trial. After error trials, gating is implemented by a change in the network dynamics that temporarily decouple the categorization of the stimulus from the across-trial accumulated evidence. Our results suggest that the rats’ suboptimal behavior reflects the influence of a structural prior that reacts to errors by isolating the network decision dynamics from the context, ultimately constraining the performance in a 2AFC laboratory task. •Rats waive across-trial accumulated evidence after an error in a 2AFC task•This suboptimal behavior may reflect the influence of an adaptive structural prior•RNNs pre-trained in a more naturalistic environment mimic rat’s suboptimal behavior•After errors, pre-trained RNNs decouple trial accumulated evidence from the decision Molano-Mazon et al. show that pre-trained recurrent networks replicate the robust, suboptimal reset behavior displayed by rats in a 2AFC task presenting serial correlations. They then use population analyses to understand the network mechanisms implemented by the pre-trained RNNs and provide specific predictions to be tested experimentally.
doi_str_mv 10.1016/j.cub.2022.12.044
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2768226049</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0960982222019819</els_id><sourcerecordid>2768226049</sourcerecordid><originalsourceid>FETCH-LOGICAL-c353t-12f9ed6acb594f8708123d23571dac77238196d4993ae510a33538445d52d44f3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMozjj6A9xIl25a82ra4EoGXzAgiK5Dmtxixk47JumM_nujoy5dnQv3OwfOQeiU4IJgIi6WhRmbgmJKC0ILzPkempK6knk6y300xVLgXNaUTtBRCEuMCa2lOEQTJkRZcV5P0dMjmNF76GPWQ9wO_jVk0NthCzbbuviShehHE0evu2zt3eDT-33daddnYWyGdXSr9NH9tzTwojeJOUYHre4CnPzoDD3fXD_N7_LFw-39_GqRG1aymBPaSrBCm6aUvK0rXBPKLGVlRaw2VUVZTaSwXEqmoSRYs2SrUzVbUst5y2bofJe79sPbCCGqlQsGuk73MIxB0Uqk8gJzmVCyQ40fQvDQqtRmpf2HIlh9jamWKo2pvsZUhKq0YPKc_cSPzQrsn-N3vQRc7gBIJTcOvArGQW_AOg8mKju4f-I_AT7lhVI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2768226049</pqid></control><display><type>article</type><title>Recurrent networks endowed with structural priors explain suboptimal animal behavior</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><source>Cell Press Free Archives</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Molano-Mazón, Manuel ; Shao, Yuxiu ; Duque, Daniel ; Yang, Guangyu Robert ; Ostojic, Srdjan ; de la Rocha, Jaime</creator><creatorcontrib>Molano-Mazón, Manuel ; Shao, Yuxiu ; Duque, Daniel ; Yang, Guangyu Robert ; Ostojic, Srdjan ; de la Rocha, Jaime</creatorcontrib><description>The strategies found by animals facing a new task are determined both by individual experience and by structural priors evolved to leverage the statistics of natural environments. Rats quickly learn to capitalize on the trial sequence correlations of two-alternative forced choice (2AFC) tasks after correct trials but consistently deviate from optimal behavior after error trials. To understand this outcome-dependent gating, we first show that recurrent neural networks (RNNs) trained in the same 2AFC task outperform rats as they can readily learn to use across-trial information both after correct and error trials. We hypothesize that, although RNNs can optimize their behavior in the 2AFC task without any a priori restrictions, rats’ strategy is constrained by a structural prior adapted to a natural environment in which rewarded and non-rewarded actions provide largely asymmetric information. When pre-training RNNs in a more ecological task with more than two possible choices, networks develop a strategy by which they gate off the across-trial evidence after errors, mimicking rats’ behavior. Population analyses show that the pre-trained networks form an accurate representation of the sequence statistics independently of the outcome in the previous trial. After error trials, gating is implemented by a change in the network dynamics that temporarily decouple the categorization of the stimulus from the across-trial accumulated evidence. Our results suggest that the rats’ suboptimal behavior reflects the influence of a structural prior that reacts to errors by isolating the network decision dynamics from the context, ultimately constraining the performance in a 2AFC laboratory task. •Rats waive across-trial accumulated evidence after an error in a 2AFC task•This suboptimal behavior may reflect the influence of an adaptive structural prior•RNNs pre-trained in a more naturalistic environment mimic rat’s suboptimal behavior•After errors, pre-trained RNNs decouple trial accumulated evidence from the decision Molano-Mazon et al. show that pre-trained recurrent networks replicate the robust, suboptimal reset behavior displayed by rats in a 2AFC task presenting serial correlations. They then use population analyses to understand the network mechanisms implemented by the pre-trained RNNs and provide specific predictions to be tested experimentally.</description><identifier>ISSN: 0960-9822</identifier><identifier>EISSN: 1879-0445</identifier><identifier>DOI: 10.1016/j.cub.2022.12.044</identifier><identifier>PMID: 36657448</identifier><language>eng</language><publisher>England: Elsevier Inc</publisher><subject>Animals ; Behavior, Animal ; Choice Behavior ; Learning ; Neural Networks, Computer ; Rats</subject><ispartof>Current biology, 2023-02, Vol.33 (4), p.622-638.e7</ispartof><rights>2022 Elsevier Inc.</rights><rights>Copyright © 2022 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c353t-12f9ed6acb594f8708123d23571dac77238196d4993ae510a33538445d52d44f3</citedby><cites>FETCH-LOGICAL-c353t-12f9ed6acb594f8708123d23571dac77238196d4993ae510a33538445d52d44f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.cub.2022.12.044$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36657448$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Molano-Mazón, Manuel</creatorcontrib><creatorcontrib>Shao, Yuxiu</creatorcontrib><creatorcontrib>Duque, Daniel</creatorcontrib><creatorcontrib>Yang, Guangyu Robert</creatorcontrib><creatorcontrib>Ostojic, Srdjan</creatorcontrib><creatorcontrib>de la Rocha, Jaime</creatorcontrib><title>Recurrent networks endowed with structural priors explain suboptimal animal behavior</title><title>Current biology</title><addtitle>Curr Biol</addtitle><description>The strategies found by animals facing a new task are determined both by individual experience and by structural priors evolved to leverage the statistics of natural environments. Rats quickly learn to capitalize on the trial sequence correlations of two-alternative forced choice (2AFC) tasks after correct trials but consistently deviate from optimal behavior after error trials. To understand this outcome-dependent gating, we first show that recurrent neural networks (RNNs) trained in the same 2AFC task outperform rats as they can readily learn to use across-trial information both after correct and error trials. We hypothesize that, although RNNs can optimize their behavior in the 2AFC task without any a priori restrictions, rats’ strategy is constrained by a structural prior adapted to a natural environment in which rewarded and non-rewarded actions provide largely asymmetric information. When pre-training RNNs in a more ecological task with more than two possible choices, networks develop a strategy by which they gate off the across-trial evidence after errors, mimicking rats’ behavior. Population analyses show that the pre-trained networks form an accurate representation of the sequence statistics independently of the outcome in the previous trial. After error trials, gating is implemented by a change in the network dynamics that temporarily decouple the categorization of the stimulus from the across-trial accumulated evidence. Our results suggest that the rats’ suboptimal behavior reflects the influence of a structural prior that reacts to errors by isolating the network decision dynamics from the context, ultimately constraining the performance in a 2AFC laboratory task. •Rats waive across-trial accumulated evidence after an error in a 2AFC task•This suboptimal behavior may reflect the influence of an adaptive structural prior•RNNs pre-trained in a more naturalistic environment mimic rat’s suboptimal behavior•After errors, pre-trained RNNs decouple trial accumulated evidence from the decision Molano-Mazon et al. show that pre-trained recurrent networks replicate the robust, suboptimal reset behavior displayed by rats in a 2AFC task presenting serial correlations. They then use population analyses to understand the network mechanisms implemented by the pre-trained RNNs and provide specific predictions to be tested experimentally.</description><subject>Animals</subject><subject>Behavior, Animal</subject><subject>Choice Behavior</subject><subject>Learning</subject><subject>Neural Networks, Computer</subject><subject>Rats</subject><issn>0960-9822</issn><issn>1879-0445</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kEtLxDAUhYMozjj6A9xIl25a82ra4EoGXzAgiK5Dmtxixk47JumM_nujoy5dnQv3OwfOQeiU4IJgIi6WhRmbgmJKC0ILzPkempK6knk6y300xVLgXNaUTtBRCEuMCa2lOEQTJkRZcV5P0dMjmNF76GPWQ9wO_jVk0NthCzbbuviShehHE0evu2zt3eDT-33daddnYWyGdXSr9NH9tzTwojeJOUYHre4CnPzoDD3fXD_N7_LFw-39_GqRG1aymBPaSrBCm6aUvK0rXBPKLGVlRaw2VUVZTaSwXEqmoSRYs2SrUzVbUst5y2bofJe79sPbCCGqlQsGuk73MIxB0Uqk8gJzmVCyQ40fQvDQqtRmpf2HIlh9jamWKo2pvsZUhKq0YPKc_cSPzQrsn-N3vQRc7gBIJTcOvArGQW_AOg8mKju4f-I_AT7lhVI</recordid><startdate>20230227</startdate><enddate>20230227</enddate><creator>Molano-Mazón, Manuel</creator><creator>Shao, Yuxiu</creator><creator>Duque, Daniel</creator><creator>Yang, Guangyu Robert</creator><creator>Ostojic, Srdjan</creator><creator>de la Rocha, Jaime</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20230227</creationdate><title>Recurrent networks endowed with structural priors explain suboptimal animal behavior</title><author>Molano-Mazón, Manuel ; Shao, Yuxiu ; Duque, Daniel ; Yang, Guangyu Robert ; Ostojic, Srdjan ; de la Rocha, Jaime</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c353t-12f9ed6acb594f8708123d23571dac77238196d4993ae510a33538445d52d44f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Animals</topic><topic>Behavior, Animal</topic><topic>Choice Behavior</topic><topic>Learning</topic><topic>Neural Networks, Computer</topic><topic>Rats</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Molano-Mazón, Manuel</creatorcontrib><creatorcontrib>Shao, Yuxiu</creatorcontrib><creatorcontrib>Duque, Daniel</creatorcontrib><creatorcontrib>Yang, Guangyu Robert</creatorcontrib><creatorcontrib>Ostojic, Srdjan</creatorcontrib><creatorcontrib>de la Rocha, Jaime</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Current biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Molano-Mazón, Manuel</au><au>Shao, Yuxiu</au><au>Duque, Daniel</au><au>Yang, Guangyu Robert</au><au>Ostojic, Srdjan</au><au>de la Rocha, Jaime</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recurrent networks endowed with structural priors explain suboptimal animal behavior</atitle><jtitle>Current biology</jtitle><addtitle>Curr Biol</addtitle><date>2023-02-27</date><risdate>2023</risdate><volume>33</volume><issue>4</issue><spage>622</spage><epage>638.e7</epage><pages>622-638.e7</pages><issn>0960-9822</issn><eissn>1879-0445</eissn><abstract>The strategies found by animals facing a new task are determined both by individual experience and by structural priors evolved to leverage the statistics of natural environments. Rats quickly learn to capitalize on the trial sequence correlations of two-alternative forced choice (2AFC) tasks after correct trials but consistently deviate from optimal behavior after error trials. To understand this outcome-dependent gating, we first show that recurrent neural networks (RNNs) trained in the same 2AFC task outperform rats as they can readily learn to use across-trial information both after correct and error trials. We hypothesize that, although RNNs can optimize their behavior in the 2AFC task without any a priori restrictions, rats’ strategy is constrained by a structural prior adapted to a natural environment in which rewarded and non-rewarded actions provide largely asymmetric information. When pre-training RNNs in a more ecological task with more than two possible choices, networks develop a strategy by which they gate off the across-trial evidence after errors, mimicking rats’ behavior. Population analyses show that the pre-trained networks form an accurate representation of the sequence statistics independently of the outcome in the previous trial. After error trials, gating is implemented by a change in the network dynamics that temporarily decouple the categorization of the stimulus from the across-trial accumulated evidence. Our results suggest that the rats’ suboptimal behavior reflects the influence of a structural prior that reacts to errors by isolating the network decision dynamics from the context, ultimately constraining the performance in a 2AFC laboratory task. •Rats waive across-trial accumulated evidence after an error in a 2AFC task•This suboptimal behavior may reflect the influence of an adaptive structural prior•RNNs pre-trained in a more naturalistic environment mimic rat’s suboptimal behavior•After errors, pre-trained RNNs decouple trial accumulated evidence from the decision Molano-Mazon et al. show that pre-trained recurrent networks replicate the robust, suboptimal reset behavior displayed by rats in a 2AFC task presenting serial correlations. They then use population analyses to understand the network mechanisms implemented by the pre-trained RNNs and provide specific predictions to be tested experimentally.</abstract><cop>England</cop><pub>Elsevier Inc</pub><pmid>36657448</pmid><doi>10.1016/j.cub.2022.12.044</doi></addata></record>
fulltext fulltext
identifier ISSN: 0960-9822
ispartof Current biology, 2023-02, Vol.33 (4), p.622-638.e7
issn 0960-9822
1879-0445
language eng
recordid cdi_proquest_miscellaneous_2768226049
source MEDLINE; Elsevier ScienceDirect Journals Complete; Cell Press Free Archives; EZB-FREE-00999 freely available EZB journals
subjects Animals
Behavior, Animal
Choice Behavior
Learning
Neural Networks, Computer
Rats
title Recurrent networks endowed with structural priors explain suboptimal animal behavior
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T17%3A06%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recurrent%20networks%20endowed%20with%20structural%20priors%20explain%20suboptimal%20animal%20behavior&rft.jtitle=Current%20biology&rft.au=Molano-Maz%C3%B3n,%20Manuel&rft.date=2023-02-27&rft.volume=33&rft.issue=4&rft.spage=622&rft.epage=638.e7&rft.pages=622-638.e7&rft.issn=0960-9822&rft.eissn=1879-0445&rft_id=info:doi/10.1016/j.cub.2022.12.044&rft_dat=%3Cproquest_cross%3E2768226049%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2768226049&rft_id=info:pmid/36657448&rft_els_id=S0960982222019819&rfr_iscdi=true