Fast exploration and learning of latent graphs with aliased observations

We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which no...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lazaro-Gredilla, Miguel, Deshpande, Ishan, Swaminathan, Sivaramakrishnan, Dave, Meet, George, Dileep
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Lazaro-Gredilla, Miguel
Deshpande, Ishan
Swaminathan, Sivaramakrishnan
Dave, Meet
George, Dileep
description We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.
doi_str_mv 10.48550/arxiv.2303.07397
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2303_07397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2303_07397</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-d40655c53531ffab04095ca3ac56fd6d0c17baa52c21ecb53f9791330a98e30c3</originalsourceid><addsrcrecordid>eNotz71OwzAUhmEvHVDLBTDVN5BwnBPHzYgqSpEqsXSPTvzTWjJOZFul3D0iMH3T-0kPY08C6nYnJTxTuvtb3SBgDQp79cCOB8qF2_scpkTFT5FTNDxYStHHC58cD1RsLPySaL5m_uXLlVPwlK3h05htui1Z3rCVo5Dt4_-u2fnwet4fq9PH2_v-5VRRp1RlWuik1BIlCudohBZ6qQlJy86ZzoAWaiSSjW6E1aNE16teIAL1O4ugcc22f7cLZZiT_6T0PfyShoWEPwqhR2I</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Fast exploration and learning of latent graphs with aliased observations</title><source>arXiv.org</source><creator>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</creator><creatorcontrib>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</creatorcontrib><description>We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.</description><identifier>DOI: 10.48550/arxiv.2303.07397</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning</subject><creationdate>2023-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2303.07397$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2303.07397$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Lazaro-Gredilla, Miguel</creatorcontrib><creatorcontrib>Deshpande, Ishan</creatorcontrib><creatorcontrib>Swaminathan, Sivaramakrishnan</creatorcontrib><creatorcontrib>Dave, Meet</creatorcontrib><creatorcontrib>George, Dileep</creatorcontrib><title>Fast exploration and learning of latent graphs with aliased observations</title><description>We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAUhmEvHVDLBTDVN5BwnBPHzYgqSpEqsXSPTvzTWjJOZFul3D0iMH3T-0kPY08C6nYnJTxTuvtb3SBgDQp79cCOB8qF2_scpkTFT5FTNDxYStHHC58cD1RsLPySaL5m_uXLlVPwlK3h05htui1Z3rCVo5Dt4_-u2fnwet4fq9PH2_v-5VRRp1RlWuik1BIlCudohBZ6qQlJy86ZzoAWaiSSjW6E1aNE16teIAL1O4ugcc22f7cLZZiT_6T0PfyShoWEPwqhR2I</recordid><startdate>20230313</startdate><enddate>20230313</enddate><creator>Lazaro-Gredilla, Miguel</creator><creator>Deshpande, Ishan</creator><creator>Swaminathan, Sivaramakrishnan</creator><creator>Dave, Meet</creator><creator>George, Dileep</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230313</creationdate><title>Fast exploration and learning of latent graphs with aliased observations</title><author>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-d40655c53531ffab04095ca3ac56fd6d0c17baa52c21ecb53f9791330a98e30c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Lazaro-Gredilla, Miguel</creatorcontrib><creatorcontrib>Deshpande, Ishan</creatorcontrib><creatorcontrib>Swaminathan, Sivaramakrishnan</creatorcontrib><creatorcontrib>Dave, Meet</creatorcontrib><creatorcontrib>George, Dileep</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lazaro-Gredilla, Miguel</au><au>Deshpande, Ishan</au><au>Swaminathan, Sivaramakrishnan</au><au>Dave, Meet</au><au>George, Dileep</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fast exploration and learning of latent graphs with aliased observations</atitle><date>2023-03-13</date><risdate>2023</risdate><abstract>We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which node it is located. The agent needs to uncover the hidden topology as accurately as possible and in as few steps as possible. This is equivalent to efficient recovery of the transition probabilities of a partially observable Markov decision process (POMDP) in which the observation probabilities are known. An algorithm for efficiently exploring (and ultimately recovering) the latent graph is provided. Our approach is exponentially faster than naive exploration in a variety of challenging topologies with aliased observations while remaining competitive with existing baselines in the unaliased regime.</abstract><doi>10.48550/arxiv.2303.07397</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2303.07397
ispartof
issn
language eng
recordid cdi_arxiv_primary_2303_07397
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Learning
title Fast exploration and learning of latent graphs with aliased observations
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T00%3A49%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fast%20exploration%20and%20learning%20of%20latent%20graphs%20with%20aliased%20observations&rft.au=Lazaro-Gredilla,%20Miguel&rft.date=2023-03-13&rft_id=info:doi/10.48550/arxiv.2303.07397&rft_dat=%3Carxiv_GOX%3E2303_07397%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true