Fast exploration and learning of latent graphs with aliased observations
We consider the problem of recovering a latent graph where the observations at each node are \emph{aliased}, and transitions are stochastic. Observations are gathered by an agent traversing the graph. Aliasing means that multiple nodes emit the same observation, so the agent can not know in which no...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Lazaro-Gredilla, Miguel Deshpande, Ishan Swaminathan, Sivaramakrishnan Dave, Meet George, Dileep |
description | We consider the problem of recovering a latent graph where the observations
at each node are \emph{aliased}, and transitions are stochastic. Observations
are gathered by an agent traversing the graph. Aliasing means that multiple
nodes emit the same observation, so the agent can not know in which node it is
located. The agent needs to uncover the hidden topology as accurately as
possible and in as few steps as possible. This is equivalent to efficient
recovery of the transition probabilities of a partially observable Markov
decision process (POMDP) in which the observation probabilities are known. An
algorithm for efficiently exploring (and ultimately recovering) the latent
graph is provided. Our approach is exponentially faster than naive exploration
in a variety of challenging topologies with aliased observations while
remaining competitive with existing baselines in the unaliased regime. |
doi_str_mv | 10.48550/arxiv.2303.07397 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2303_07397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2303_07397</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-d40655c53531ffab04095ca3ac56fd6d0c17baa52c21ecb53f9791330a98e30c3</originalsourceid><addsrcrecordid>eNotz71OwzAUhmEvHVDLBTDVN5BwnBPHzYgqSpEqsXSPTvzTWjJOZFul3D0iMH3T-0kPY08C6nYnJTxTuvtb3SBgDQp79cCOB8qF2_scpkTFT5FTNDxYStHHC58cD1RsLPySaL5m_uXLlVPwlK3h05htui1Z3rCVo5Dt4_-u2fnwet4fq9PH2_v-5VRRp1RlWuik1BIlCudohBZ6qQlJy86ZzoAWaiSSjW6E1aNE16teIAL1O4ugcc22f7cLZZiT_6T0PfyShoWEPwqhR2I</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Fast exploration and learning of latent graphs with aliased observations</title><source>arXiv.org</source><creator>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</creator><creatorcontrib>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</creatorcontrib><description>We consider the problem of recovering a latent graph where the observations
at each node are \emph{aliased}, and transitions are stochastic. Observations
are gathered by an agent traversing the graph. Aliasing means that multiple
nodes emit the same observation, so the agent can not know in which node it is
located. The agent needs to uncover the hidden topology as accurately as
possible and in as few steps as possible. This is equivalent to efficient
recovery of the transition probabilities of a partially observable Markov
decision process (POMDP) in which the observation probabilities are known. An
algorithm for efficiently exploring (and ultimately recovering) the latent
graph is provided. Our approach is exponentially faster than naive exploration
in a variety of challenging topologies with aliased observations while
remaining competitive with existing baselines in the unaliased regime.</description><identifier>DOI: 10.48550/arxiv.2303.07397</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning</subject><creationdate>2023-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2303.07397$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2303.07397$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Lazaro-Gredilla, Miguel</creatorcontrib><creatorcontrib>Deshpande, Ishan</creatorcontrib><creatorcontrib>Swaminathan, Sivaramakrishnan</creatorcontrib><creatorcontrib>Dave, Meet</creatorcontrib><creatorcontrib>George, Dileep</creatorcontrib><title>Fast exploration and learning of latent graphs with aliased observations</title><description>We consider the problem of recovering a latent graph where the observations
at each node are \emph{aliased}, and transitions are stochastic. Observations
are gathered by an agent traversing the graph. Aliasing means that multiple
nodes emit the same observation, so the agent can not know in which node it is
located. The agent needs to uncover the hidden topology as accurately as
possible and in as few steps as possible. This is equivalent to efficient
recovery of the transition probabilities of a partially observable Markov
decision process (POMDP) in which the observation probabilities are known. An
algorithm for efficiently exploring (and ultimately recovering) the latent
graph is provided. Our approach is exponentially faster than naive exploration
in a variety of challenging topologies with aliased observations while
remaining competitive with existing baselines in the unaliased regime.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAUhmEvHVDLBTDVN5BwnBPHzYgqSpEqsXSPTvzTWjJOZFul3D0iMH3T-0kPY08C6nYnJTxTuvtb3SBgDQp79cCOB8qF2_scpkTFT5FTNDxYStHHC58cD1RsLPySaL5m_uXLlVPwlK3h05htui1Z3rCVo5Dt4_-u2fnwet4fq9PH2_v-5VRRp1RlWuik1BIlCudohBZ6qQlJy86ZzoAWaiSSjW6E1aNE16teIAL1O4ugcc22f7cLZZiT_6T0PfyShoWEPwqhR2I</recordid><startdate>20230313</startdate><enddate>20230313</enddate><creator>Lazaro-Gredilla, Miguel</creator><creator>Deshpande, Ishan</creator><creator>Swaminathan, Sivaramakrishnan</creator><creator>Dave, Meet</creator><creator>George, Dileep</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230313</creationdate><title>Fast exploration and learning of latent graphs with aliased observations</title><author>Lazaro-Gredilla, Miguel ; Deshpande, Ishan ; Swaminathan, Sivaramakrishnan ; Dave, Meet ; George, Dileep</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-d40655c53531ffab04095ca3ac56fd6d0c17baa52c21ecb53f9791330a98e30c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Lazaro-Gredilla, Miguel</creatorcontrib><creatorcontrib>Deshpande, Ishan</creatorcontrib><creatorcontrib>Swaminathan, Sivaramakrishnan</creatorcontrib><creatorcontrib>Dave, Meet</creatorcontrib><creatorcontrib>George, Dileep</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lazaro-Gredilla, Miguel</au><au>Deshpande, Ishan</au><au>Swaminathan, Sivaramakrishnan</au><au>Dave, Meet</au><au>George, Dileep</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fast exploration and learning of latent graphs with aliased observations</atitle><date>2023-03-13</date><risdate>2023</risdate><abstract>We consider the problem of recovering a latent graph where the observations
at each node are \emph{aliased}, and transitions are stochastic. Observations
are gathered by an agent traversing the graph. Aliasing means that multiple
nodes emit the same observation, so the agent can not know in which node it is
located. The agent needs to uncover the hidden topology as accurately as
possible and in as few steps as possible. This is equivalent to efficient
recovery of the transition probabilities of a partially observable Markov
decision process (POMDP) in which the observation probabilities are known. An
algorithm for efficiently exploring (and ultimately recovering) the latent
graph is provided. Our approach is exponentially faster than naive exploration
in a variety of challenging topologies with aliased observations while
remaining competitive with existing baselines in the unaliased regime.</abstract><doi>10.48550/arxiv.2303.07397</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2303.07397 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2303_07397 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Learning |
title | Fast exploration and learning of latent graphs with aliased observations |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T00%3A49%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fast%20exploration%20and%20learning%20of%20latent%20graphs%20with%20aliased%20observations&rft.au=Lazaro-Gredilla,%20Miguel&rft.date=2023-03-13&rft_id=info:doi/10.48550/arxiv.2303.07397&rft_dat=%3Carxiv_GOX%3E2303_07397%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |