Active World Model Learning with Progress Curiosity

World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Kim, Kuno, Sano, Megumi, De Freitas, Julian, Haber, Nick, Yamins, Daniel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Kim, Kuno
Sano, Megumi
De Freitas, Julian
Haber, Nick
Yamins, Daniel
description World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.
doi_str_mv 10.48550/arxiv.2007.07853
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2007_07853</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2007_07853</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-92f2806ad90b0744de5e3f6853a432be6239930d415692706d9e1fc6377fce5f3</originalsourceid><addsrcrecordid>eNotzr1uwjAUhmEvHSrgApjqG0h64uOfeERRC5WC6IDUMTLxMbUUSOWktNw9BTp92_s9jM0LyGWpFDy79BtPuQAwOZhS4SPDRTvGE_GPPnWer3tPHa_JpWM87vlPHD_5e-r3iYaBV98p9kMcz1P2EFw30Ox_J2z7-rKtVlm9Wb5Vizpz2mBmRRAlaOct7MBI6UkRBv336iSKHWmB1iJ4WShthQHtLRWh1WhMaEkFnLCne_ambr5SPLh0bq765qbHC8OpPjM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Active World Model Learning with Progress Curiosity</title><source>arXiv.org</source><creator>Kim, Kuno ; Sano, Megumi ; De Freitas, Julian ; Haber, Nick ; Yamins, Daniel</creator><creatorcontrib>Kim, Kuno ; Sano, Megumi ; De Freitas, Julian ; Haber, Nick ; Yamins, Daniel</creatorcontrib><description>World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.</description><identifier>DOI: 10.48550/arxiv.2007.07853</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning ; Statistics - Machine Learning</subject><creationdate>2020-07</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,778,883</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2007.07853$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2007.07853$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Kim, Kuno</creatorcontrib><creatorcontrib>Sano, Megumi</creatorcontrib><creatorcontrib>De Freitas, Julian</creatorcontrib><creatorcontrib>Haber, Nick</creatorcontrib><creatorcontrib>Yamins, Daniel</creatorcontrib><title>Active World Model Learning with Progress Curiosity</title><description>World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzr1uwjAUhmEvHSrgApjqG0h64uOfeERRC5WC6IDUMTLxMbUUSOWktNw9BTp92_s9jM0LyGWpFDy79BtPuQAwOZhS4SPDRTvGE_GPPnWer3tPHa_JpWM87vlPHD_5e-r3iYaBV98p9kMcz1P2EFw30Ox_J2z7-rKtVlm9Wb5Vizpz2mBmRRAlaOct7MBI6UkRBv336iSKHWmB1iJ4WShthQHtLRWh1WhMaEkFnLCne_ambr5SPLh0bq765qbHC8OpPjM</recordid><startdate>20200715</startdate><enddate>20200715</enddate><creator>Kim, Kuno</creator><creator>Sano, Megumi</creator><creator>De Freitas, Julian</creator><creator>Haber, Nick</creator><creator>Yamins, Daniel</creator><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20200715</creationdate><title>Active World Model Learning with Progress Curiosity</title><author>Kim, Kuno ; Sano, Megumi ; De Freitas, Julian ; Haber, Nick ; Yamins, Daniel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-92f2806ad90b0744de5e3f6853a432be6239930d415692706d9e1fc6377fce5f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Kim, Kuno</creatorcontrib><creatorcontrib>Sano, Megumi</creatorcontrib><creatorcontrib>De Freitas, Julian</creatorcontrib><creatorcontrib>Haber, Nick</creatorcontrib><creatorcontrib>Yamins, Daniel</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Kim, Kuno</au><au>Sano, Megumi</au><au>De Freitas, Julian</au><au>Haber, Nick</au><au>Yamins, Daniel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Active World Model Learning with Progress Curiosity</atitle><date>2020-07-15</date><risdate>2020</risdate><abstract>World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.</abstract><doi>10.48550/arxiv.2007.07853</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2007.07853
ispartof
issn
language eng
recordid cdi_arxiv_primary_2007_07853
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Learning
Statistics - Machine Learning
title Active World Model Learning with Progress Curiosity
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T06%3A54%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Active%20World%20Model%20Learning%20with%20Progress%20Curiosity&rft.au=Kim,%20Kuno&rft.date=2020-07-15&rft_id=info:doi/10.48550/arxiv.2007.07853&rft_dat=%3Carxiv_GOX%3E2007_07853%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true