Fast exploration and learning of latent graphs with aliased observations

We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which no...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Lazaro-Gredilla, Miguel, Deshpande, Ishan, Swaminathan, Sivaramakrishnan, Dave, Meet, George, Dileep
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Artificial Intelligence Computer Science - Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Lazaro-Gredilla, Miguel Deshpande, Ishan Swaminathan, Sivaramakrishnan Dave, Meet George, Dileep
description	We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.
doi_str_mv	10.48550/arxiv.2303.07397
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2303_07397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2303_07397</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-d40655c53531ffab04095ca3ac56fd6d0c17baa52c21ecb53f9791330a98e30c3</originalsourceid><addsrcrecordid>eNotz71OwzAUhmEvHVDLBTDVN5BwnBPHzYgqSpEqsXSPTvzTWjJOZFul3D0iMH3T-0kPY08C6nYnJTxTuvtb3SBgDQp79cCOB8qF2_scpkTFT5FTNDxYStHHC58cD1RsLPySaL5m_uXLlVPwlK3h05htui1Z3rCVo5Dt4_-u2fnwet4fq9PH2_v-5VRRp1RlWuik1BIlCudohBZ6qQlJy86ZzoAWaiSSjW6E1aNE16teIAL1O4ugcc22f7cLZZiT_6T0PfyShoWEPwqhR2I</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Fast exploration and learning of latent graphs with aliased observations</title><source>arXiv.org</source><creator>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</creator><creatorcontrib>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</creatorcontrib><description>We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.</description><identifier>DOI: 10.48550/arxiv.2303.07397</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning</subject><creationdate>2023-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2303.07397$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2303.07397$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Lazaro-Gredilla, Miguel</creatorcontrib><creatorcontrib>Deshpande, Ishan</creatorcontrib><creatorcontrib>Swaminathan, Sivaramakrishnan</creatorcontrib><creatorcontrib>Dave, Meet</creatorcontrib><creatorcontrib>George, Dileep</creatorcontrib><title>Fast exploration and learning of latent graphs with aliased observations</title><description>We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAUhmEvHVDLBTDVN5BwnBPHzYgqSpEqsXSPTvzTWjJOZFul3D0iMH3T-0kPY08C6nYnJTxTuvtb3SBgDQp79cCOB8qF2_scpkTFT5FTNDxYStHHC58cD1RsLPySaL5m_uXLlVPwlK3h05htui1Z3rCVo5Dt4_-u2fnwet4fq9PH2_v-5VRRp1RlWuik1BIlCudohBZ6qQlJy86ZzoAWaiSSjW6E1aNE16teIAL1O4ugcc22f7cLZZiT_6T0PfyShoWEPwqhR2I</recordid><startdate>20230313</startdate><enddate>20230313</enddate><creator>Lazaro-Gredilla, Miguel</creator><creator>Deshpande, Ishan</creator><creator>Swaminathan, Sivaramakrishnan</creator><creator>Dave, Meet</creator><creator>George, Dileep</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230313</creationdate><title>Fast exploration and learning of latent graphs with aliased observations</title><author>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-d40655c53531ffab04095ca3ac56fd6d0c17baa52c21ecb53f9791330a98e30c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Lazaro-Gredilla, Miguel</creatorcontrib><creatorcontrib>Deshpande, Ishan</creatorcontrib><creatorcontrib>Swaminathan, Sivaramakrishnan</creatorcontrib><creatorcontrib>Dave, Meet</creatorcontrib><creatorcontrib>George, Dileep</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lazaro-Gredilla, Miguel</au><au>Deshpande, Ishan</au><au>Swaminathan, Sivaramakrishnan</au><au>Dave, Meet</au><au>George, Dileep</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fast exploration and learning of latent graphs with aliased observations</atitle><date>2023-03-13</date><risdate>2023</risdate><abstract>We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.</abstract><doi>10.48550/arxiv.2303.07397</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2303.07397
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2303_07397
source	arXiv.org
subjects	Computer Science - Artificial Intelligence Computer Science - Learning
title	Fast exploration and learning of latent graphs with aliased observations
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T00%3A49%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fast%20exploration%20and%20learning%20of%20latent%20graphs%20with%20aliased%20observations&rft.au=Lazaro-Gredilla,%20Miguel&rft.date=2023-03-13&rft_id=info:doi/10.48550/arxiv.2303.07397&rft_dat=%3Carxiv_GOX%3E2303_07397%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true