Active World Model Learning with Progress Curiosity

World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Kim, Kuno, Sano, Megumi, De Freitas, Julian, Haber, Nick, Yamins, Daniel
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Artificial Intelligence Computer Science - Learning Statistics - Machine Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Kim, Kuno Sano, Megumi De Freitas, Julian Haber, Nick Yamins, Daniel
description	World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.
doi_str_mv	10.48550/arxiv.2007.07853
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2007_07853</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2007_07853</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-92f2806ad90b0744de5e3f6853a432be6239930d415692706d9e1fc6377fce5f3</originalsourceid><addsrcrecordid>eNotzr1uwjAUhmEvHSrgApjqG0h64uOfeERRC5WC6IDUMTLxMbUUSOWktNw9BTp92_s9jM0LyGWpFDy79BtPuQAwOZhS4SPDRTvGE_GPPnWer3tPHa_JpWM87vlPHD_5e-r3iYaBV98p9kMcz1P2EFw30Ox_J2z7-rKtVlm9Wb5Vizpz2mBmRRAlaOct7MBI6UkRBv336iSKHWmB1iJ4WShthQHtLRWh1WhMaEkFnLCne_ambr5SPLh0bq765qbHC8OpPjM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Active World Model Learning with Progress Curiosity</title><source>arXiv.org</source><creator>Kim, Kuno ; Sano, Megumi ; De Freitas, Julian ; Haber, Nick ; Yamins, Daniel</creator><creatorcontrib>Kim, Kuno ; Sano, Megumi ; De Freitas, Julian ; Haber, Nick ; Yamins, Daniel</creatorcontrib><description>World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.</description><identifier>DOI: 10.48550/arxiv.2007.07853</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning ; Statistics - Machine Learning</subject><creationdate>2020-07</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,778,883</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2007.07853$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2007.07853$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Kim, Kuno</creatorcontrib><creatorcontrib>Sano, Megumi</creatorcontrib><creatorcontrib>De Freitas, Julian</creatorcontrib><creatorcontrib>Haber, Nick</creatorcontrib><creatorcontrib>Yamins, Daniel</creatorcontrib><title>Active World Model Learning with Progress Curiosity</title><description>World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzr1uwjAUhmEvHSrgApjqG0h64uOfeERRC5WC6IDUMTLxMbUUSOWktNw9BTp92_s9jM0LyGWpFDy79BtPuQAwOZhS4SPDRTvGE_GPPnWer3tPHa_JpWM87vlPHD_5e-r3iYaBV98p9kMcz1P2EFw30Ox_J2z7-rKtVlm9Wb5Vizpz2mBmRRAlaOct7MBI6UkRBv336iSKHWmB1iJ4WShthQHtLRWh1WhMaEkFnLCne_ambr5SPLh0bq765qbHC8OpPjM</recordid><startdate>20200715</startdate><enddate>20200715</enddate><creator>Kim, Kuno</creator><creator>Sano, Megumi</creator><creator>De Freitas, Julian</creator><creator>Haber, Nick</creator><creator>Yamins, Daniel</creator><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20200715</creationdate><title>Active World Model Learning with Progress Curiosity</title><author>Kim, Kuno ; Sano, Megumi ; De Freitas, Julian ; Haber, Nick ; Yamins, Daniel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-92f2806ad90b0744de5e3f6853a432be6239930d415692706d9e1fc6377fce5f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Kim, Kuno</creatorcontrib><creatorcontrib>Sano, Megumi</creatorcontrib><creatorcontrib>De Freitas, Julian</creatorcontrib><creatorcontrib>Haber, Nick</creatorcontrib><creatorcontrib>Yamins, Daniel</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Kim, Kuno</au><au>Sano, Megumi</au><au>De Freitas, Julian</au><au>Haber, Nick</au><au>Yamins, Daniel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Active World Model Learning with Progress Curiosity</atitle><date>2020-07-15</date><risdate>2020</risdate><abstract>World models are self-supervised predictive models of how the world evolves. Humans learn world models by curiously exploring their environment, in the process acquiring compact abstractions of high bandwidth sensory inputs, the ability to plan across long temporal horizons, and an understanding of the behavioral patterns of other agents. In this work, we study how to design such a curiosity-driven Active World Model Learning (AWML) system. To do so, we construct a curious agent building world models while visually exploring a 3D physical environment rich with distillations of representative real-world agents. We propose an AWML system driven by $\gamma$-Progress: a scalable and effective learning progress-based curiosity signal. We show that $\gamma$-Progress naturally gives rise to an exploration policy that directs attention to complex but learnable dynamics in a balanced manner, thus overcoming the "white noise problem". As a result, our $\gamma$-Progress-driven controller achieves significantly higher AWML performance than baseline controllers equipped with state-of-the-art exploration strategies such as Random Network Distillation and Model Disagreement.</abstract><doi>10.48550/arxiv.2007.07853</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2007.07853
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2007_07853
source	arXiv.org
subjects	Computer Science - Artificial Intelligence Computer Science - Learning Statistics - Machine Learning
title	Active World Model Learning with Progress Curiosity
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T06%3A54%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Active%20World%20Model%20Learning%20with%20Progress%20Curiosity&rft.au=Kim,%20Kuno&rft.date=2020-07-15&rft_id=info:doi/10.48550/arxiv.2007.07853&rft_dat=%3Carxiv_GOX%3E2007_07853%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true