C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations po...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on cognitive and developmental systems 2023-03, Vol.15 (1), p.210-222
Hauptverfasser:	Santucci, Vieri Giuliano, Montella, Davide, Baldassarre, Gianluca
Format:	Artikel
Sprache:	eng
Schlagworte:	Autonomous robotics context-dependent goals developmental robotics Face recognition intrinsic motivations (IMs) multitask reinforcement learning (RL) Multitasking Policies Reinforcement learning Robot sensing systems Robots Task analysis Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	222
container_issue	1
container_start_page	210
container_title	IEEE transactions on cognitive and developmental systems
container_volume	15
creator	Santucci, Vieri Giuliano Montella, Davide Baldassarre, Gianluca
description	When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: 1) recognize which are the contexts that need different policies to perform the goals and 2) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.
doi_str_mv	10.1109/TCDS.2022.3152081
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2784549686</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9714489</ieee_id><sourcerecordid>2784549686</sourcerecordid><originalsourceid>FETCH-LOGICAL-c336t-9227ae81a90860f0bc12c6bd1ccac56358c6e217405d47b41ecc6f75f0c2bd293</originalsourceid><addsrcrecordid>eNo9kE1Lw0AQhhdRsNT-APES8Jy6H8l-eCupxkJEqPW8JJuJpLS7cTcB_fcmtPT0zuF5Z5gHoXuCl4Rg9bTL1p9LiildMpJSLMkVmlEmVCwVU9eXmeJbtAhhjzEmnAmZiBnaZnG-XW2K52g19M66oxtCtIXWNs4bOILtowJKb1v7Hbkmeh8OfdsdICptHWXO9vDbx2vowNYTmrvyEO7QTTMGLM45R1-vL7vsLS4-8k22KmLDGO9jRakoQZJSYclxgytDqOFVTYwpTcpZKg0HSkSC0zoRVULAGN6ItMGGVjVVbI4eT3s7734GCL3eu8Hb8aSm43NporjkI0VOlPEuBA-N7nx7LP2fJlhP9vRkT0_29Nne2Hk4dVoAuPBKkCQZjf4DLupqMg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2784549686</pqid></control><display><type>article</type><title>C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals</title><source>IEEE Electronic Library (IEL)</source><creator>Santucci, Vieri Giuliano ; Montella, Davide ; Baldassarre, Gianluca</creator><creatorcontrib>Santucci, Vieri Giuliano ; Montella, Davide ; Baldassarre, Gianluca</creatorcontrib><description>When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: 1) recognize which are the contexts that need different policies to perform the goals and 2) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.</description><identifier>ISSN: 2379-8920</identifier><identifier>EISSN: 2379-8939</identifier><identifier>DOI: 10.1109/TCDS.2022.3152081</identifier><identifier>CODEN: ITCDA4</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Autonomous robotics ; context-dependent goals ; developmental robotics ; Face recognition ; intrinsic motivations (IMs) ; multitask reinforcement learning (RL) ; Multitasking ; Policies ; Reinforcement learning ; Robot sensing systems ; Robots ; Task analysis ; Training</subject><ispartof>IEEE transactions on cognitive and developmental systems, 2023-03, Vol.15 (1), p.210-222</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c336t-9227ae81a90860f0bc12c6bd1ccac56358c6e217405d47b41ecc6f75f0c2bd293</citedby><cites>FETCH-LOGICAL-c336t-9227ae81a90860f0bc12c6bd1ccac56358c6e217405d47b41ecc6f75f0c2bd293</cites><orcidid>0000-0002-8748-9632 ; 0000-0002-1277-4447</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9714489$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids></links><search><creatorcontrib>Santucci, Vieri Giuliano</creatorcontrib><creatorcontrib>Montella, Davide</creatorcontrib><creatorcontrib>Baldassarre, Gianluca</creatorcontrib><title>C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals</title><title>IEEE transactions on cognitive and developmental systems</title><addtitle>TCDS</addtitle><description>When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: 1) recognize which are the contexts that need different policies to perform the goals and 2) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.</description><subject>Autonomous robotics</subject><subject>context-dependent goals</subject><subject>developmental robotics</subject><subject>Face recognition</subject><subject>intrinsic motivations (IMs)</subject><subject>multitask reinforcement learning (RL)</subject><subject>Multitasking</subject><subject>Policies</subject><subject>Reinforcement learning</subject><subject>Robot sensing systems</subject><subject>Robots</subject><subject>Task analysis</subject><subject>Training</subject><issn>2379-8920</issn><issn>2379-8939</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><recordid>eNo9kE1Lw0AQhhdRsNT-APES8Jy6H8l-eCupxkJEqPW8JJuJpLS7cTcB_fcmtPT0zuF5Z5gHoXuCl4Rg9bTL1p9LiildMpJSLMkVmlEmVCwVU9eXmeJbtAhhjzEmnAmZiBnaZnG-XW2K52g19M66oxtCtIXWNs4bOILtowJKb1v7Hbkmeh8OfdsdICptHWXO9vDbx2vowNYTmrvyEO7QTTMGLM45R1-vL7vsLS4-8k22KmLDGO9jRakoQZJSYclxgytDqOFVTYwpTcpZKg0HSkSC0zoRVULAGN6ItMGGVjVVbI4eT3s7734GCL3eu8Hb8aSm43NporjkI0VOlPEuBA-N7nx7LP2fJlhP9vRkT0_29Nne2Hk4dVoAuPBKkCQZjf4DLupqMg</recordid><startdate>20230301</startdate><enddate>20230301</enddate><creator>Santucci, Vieri Giuliano</creator><creator>Montella, Davide</creator><creator>Baldassarre, Gianluca</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-8748-9632</orcidid><orcidid>https://orcid.org/0000-0002-1277-4447</orcidid></search><sort><creationdate>20230301</creationdate><title>C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals</title><author>Santucci, Vieri Giuliano ; Montella, Davide ; Baldassarre, Gianluca</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c336t-9227ae81a90860f0bc12c6bd1ccac56358c6e217405d47b41ecc6f75f0c2bd293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Autonomous robotics</topic><topic>context-dependent goals</topic><topic>developmental robotics</topic><topic>Face recognition</topic><topic>intrinsic motivations (IMs)</topic><topic>multitask reinforcement learning (RL)</topic><topic>Multitasking</topic><topic>Policies</topic><topic>Reinforcement learning</topic><topic>Robot sensing systems</topic><topic>Robots</topic><topic>Task analysis</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Santucci, Vieri Giuliano</creatorcontrib><creatorcontrib>Montella, Davide</creatorcontrib><creatorcontrib>Baldassarre, Gianluca</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on cognitive and developmental systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Santucci, Vieri Giuliano</au><au>Montella, Davide</au><au>Baldassarre, Gianluca</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals</atitle><jtitle>IEEE transactions on cognitive and developmental systems</jtitle><stitle>TCDS</stitle><date>2023-03-01</date><risdate>2023</risdate><volume>15</volume><issue>1</issue><spage>210</spage><epage>222</epage><pages>210-222</pages><issn>2379-8920</issn><eissn>2379-8939</eissn><coden>ITCDA4</coden><abstract>When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: 1) recognize which are the contexts that need different policies to perform the goals and 2) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TCDS.2022.3152081</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-8748-9632</orcidid><orcidid>https://orcid.org/0000-0002-1277-4447</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2379-8920
ispartof	IEEE transactions on cognitive and developmental systems, 2023-03, Vol.15 (1), p.210-222
issn	2379-8920 2379-8939
language	eng
recordid	cdi_proquest_journals_2784549686
source	IEEE Electronic Library (IEL)
subjects	Autonomous robotics context-dependent goals developmental robotics Face recognition intrinsic motivations (IMs) multitask reinforcement learning (RL) Multitasking Policies Reinforcement learning Robot sensing systems Robots Task analysis Training
title	C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T06%3A27%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=C-GRAIL:%20Autonomous%20Reinforcement%20Learning%20of%20Multiple%20and%20Context-Dependent%20Goals&rft.jtitle=IEEE%20transactions%20on%20cognitive%20and%20developmental%20systems&rft.au=Santucci,%20Vieri%20Giuliano&rft.date=2023-03-01&rft.volume=15&rft.issue=1&rft.spage=210&rft.epage=222&rft.pages=210-222&rft.issn=2379-8920&rft.eissn=2379-8939&rft.coden=ITCDA4&rft_id=info:doi/10.1109/TCDS.2022.3152081&rft_dat=%3Cproquest_cross%3E2784549686%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2784549686&rft_id=info:pmid/&rft_ieee_id=9714489&rfr_iscdi=true