DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
We present DemoStart, a novel auto-curriculum reinforcement learning method capable of learning complex manipulation behaviors on an arm equipped with a three-fingered robotic hand, from only a sparse reward and a handful of demonstrations in simulation. Learning from simulation drastically reduces...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Bauza, Maria Chen, Jose Enrique Dalibard, Valentin Gileadi, Nimrod Hafner, Roland Martins, Murilo F Moore, Joss Pevceviciute, Rugile Laurens, Antoine Rao, Dushyant Zambelli, Martina Riedmiller, Martin Scholz, Jon Bousmalis, Konstantinos Nori, Francesco Heess, Nicolas |
description | We present DemoStart, a novel auto-curriculum reinforcement learning method
capable of learning complex manipulation behaviors on an arm equipped with a
three-fingered robotic hand, from only a sparse reward and a handful of
demonstrations in simulation. Learning from simulation drastically reduces the
development cycle of behavior generation, and domain randomization techniques
are leveraged to achieve successful zero-shot sim-to-real transfer. Transferred
policies are learned directly from raw pixels from multiple cameras and robot
proprioception. Our approach outperforms policies learned from demonstrations
on the real robot and requires 100 times fewer demonstrations, collected in
simulation. More details and videos in https://sites.google.com/view/demostart. |
doi_str_mv | 10.48550/arxiv.2409.06613 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_06613</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_06613</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_066133</originalsourceid><addsrcrecordid>eNqFjrkOwjAQRN1QIOADqPAPOARyCGg5RA810RIcWMmOo_Wa4-9JInqqGc284gkxXcRRusqyeA70xme0TON1FOf5IhmKy05bd2Ig3siu1p4JGF2tjL5JCOxUGYiwDCZYCU1jsN3ZSY9WtSdpMPKF_JA2GEZVYX3X1CLkro79WAwqMF5PfjkSs8P-vD2q3qRoCC3Qp-iMit4o-U98ATnxQpM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots</title><source>arXiv.org</source><creator>Bauza, Maria ; Chen, Jose Enrique ; Dalibard, Valentin ; Gileadi, Nimrod ; Hafner, Roland ; Martins, Murilo F ; Moore, Joss ; Pevceviciute, Rugile ; Laurens, Antoine ; Rao, Dushyant ; Zambelli, Martina ; Riedmiller, Martin ; Scholz, Jon ; Bousmalis, Konstantinos ; Nori, Francesco ; Heess, Nicolas</creator><creatorcontrib>Bauza, Maria ; Chen, Jose Enrique ; Dalibard, Valentin ; Gileadi, Nimrod ; Hafner, Roland ; Martins, Murilo F ; Moore, Joss ; Pevceviciute, Rugile ; Laurens, Antoine ; Rao, Dushyant ; Zambelli, Martina ; Riedmiller, Martin ; Scholz, Jon ; Bousmalis, Konstantinos ; Nori, Francesco ; Heess, Nicolas</creatorcontrib><description>We present DemoStart, a novel auto-curriculum reinforcement learning method
capable of learning complex manipulation behaviors on an arm equipped with a
three-fingered robotic hand, from only a sparse reward and a handful of
demonstrations in simulation. Learning from simulation drastically reduces the
development cycle of behavior generation, and domain randomization techniques
are leveraged to achieve successful zero-shot sim-to-real transfer. Transferred
policies are learned directly from raw pixels from multiple cameras and robot
proprioception. Our approach outperforms policies learned from demonstrations
on the real robot and requires 100 times fewer demonstrations, collected in
simulation. More details and videos in https://sites.google.com/view/demostart.</description><identifier>DOI: 10.48550/arxiv.2409.06613</identifier><language>eng</language><subject>Computer Science - Learning ; Computer Science - Robotics</subject><creationdate>2024-09</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.06613$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.06613$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Bauza, Maria</creatorcontrib><creatorcontrib>Chen, Jose Enrique</creatorcontrib><creatorcontrib>Dalibard, Valentin</creatorcontrib><creatorcontrib>Gileadi, Nimrod</creatorcontrib><creatorcontrib>Hafner, Roland</creatorcontrib><creatorcontrib>Martins, Murilo F</creatorcontrib><creatorcontrib>Moore, Joss</creatorcontrib><creatorcontrib>Pevceviciute, Rugile</creatorcontrib><creatorcontrib>Laurens, Antoine</creatorcontrib><creatorcontrib>Rao, Dushyant</creatorcontrib><creatorcontrib>Zambelli, Martina</creatorcontrib><creatorcontrib>Riedmiller, Martin</creatorcontrib><creatorcontrib>Scholz, Jon</creatorcontrib><creatorcontrib>Bousmalis, Konstantinos</creatorcontrib><creatorcontrib>Nori, Francesco</creatorcontrib><creatorcontrib>Heess, Nicolas</creatorcontrib><title>DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots</title><description>We present DemoStart, a novel auto-curriculum reinforcement learning method
capable of learning complex manipulation behaviors on an arm equipped with a
three-fingered robotic hand, from only a sparse reward and a handful of
demonstrations in simulation. Learning from simulation drastically reduces the
development cycle of behavior generation, and domain randomization techniques
are leveraged to achieve successful zero-shot sim-to-real transfer. Transferred
policies are learned directly from raw pixels from multiple cameras and robot
proprioception. Our approach outperforms policies learned from demonstrations
on the real robot and requires 100 times fewer demonstrations, collected in
simulation. More details and videos in https://sites.google.com/view/demostart.</description><subject>Computer Science - Learning</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrkOwjAQRN1QIOADqPAPOARyCGg5RA810RIcWMmOo_Wa4-9JInqqGc284gkxXcRRusqyeA70xme0TON1FOf5IhmKy05bd2Ig3siu1p4JGF2tjL5JCOxUGYiwDCZYCU1jsN3ZSY9WtSdpMPKF_JA2GEZVYX3X1CLkro79WAwqMF5PfjkSs8P-vD2q3qRoCC3Qp-iMit4o-U98ATnxQpM</recordid><startdate>20240910</startdate><enddate>20240910</enddate><creator>Bauza, Maria</creator><creator>Chen, Jose Enrique</creator><creator>Dalibard, Valentin</creator><creator>Gileadi, Nimrod</creator><creator>Hafner, Roland</creator><creator>Martins, Murilo F</creator><creator>Moore, Joss</creator><creator>Pevceviciute, Rugile</creator><creator>Laurens, Antoine</creator><creator>Rao, Dushyant</creator><creator>Zambelli, Martina</creator><creator>Riedmiller, Martin</creator><creator>Scholz, Jon</creator><creator>Bousmalis, Konstantinos</creator><creator>Nori, Francesco</creator><creator>Heess, Nicolas</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240910</creationdate><title>DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots</title><author>Bauza, Maria ; Chen, Jose Enrique ; Dalibard, Valentin ; Gileadi, Nimrod ; Hafner, Roland ; Martins, Murilo F ; Moore, Joss ; Pevceviciute, Rugile ; Laurens, Antoine ; Rao, Dushyant ; Zambelli, Martina ; Riedmiller, Martin ; Scholz, Jon ; Bousmalis, Konstantinos ; Nori, Francesco ; Heess, Nicolas</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_066133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Learning</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Bauza, Maria</creatorcontrib><creatorcontrib>Chen, Jose Enrique</creatorcontrib><creatorcontrib>Dalibard, Valentin</creatorcontrib><creatorcontrib>Gileadi, Nimrod</creatorcontrib><creatorcontrib>Hafner, Roland</creatorcontrib><creatorcontrib>Martins, Murilo F</creatorcontrib><creatorcontrib>Moore, Joss</creatorcontrib><creatorcontrib>Pevceviciute, Rugile</creatorcontrib><creatorcontrib>Laurens, Antoine</creatorcontrib><creatorcontrib>Rao, Dushyant</creatorcontrib><creatorcontrib>Zambelli, Martina</creatorcontrib><creatorcontrib>Riedmiller, Martin</creatorcontrib><creatorcontrib>Scholz, Jon</creatorcontrib><creatorcontrib>Bousmalis, Konstantinos</creatorcontrib><creatorcontrib>Nori, Francesco</creatorcontrib><creatorcontrib>Heess, Nicolas</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bauza, Maria</au><au>Chen, Jose Enrique</au><au>Dalibard, Valentin</au><au>Gileadi, Nimrod</au><au>Hafner, Roland</au><au>Martins, Murilo F</au><au>Moore, Joss</au><au>Pevceviciute, Rugile</au><au>Laurens, Antoine</au><au>Rao, Dushyant</au><au>Zambelli, Martina</au><au>Riedmiller, Martin</au><au>Scholz, Jon</au><au>Bousmalis, Konstantinos</au><au>Nori, Francesco</au><au>Heess, Nicolas</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots</atitle><date>2024-09-10</date><risdate>2024</risdate><abstract>We present DemoStart, a novel auto-curriculum reinforcement learning method
capable of learning complex manipulation behaviors on an arm equipped with a
three-fingered robotic hand, from only a sparse reward and a handful of
demonstrations in simulation. Learning from simulation drastically reduces the
development cycle of behavior generation, and domain randomization techniques
are leveraged to achieve successful zero-shot sim-to-real transfer. Transferred
policies are learned directly from raw pixels from multiple cameras and robot
proprioception. Our approach outperforms policies learned from demonstrations
on the real robot and requires 100 times fewer demonstrations, collected in
simulation. More details and videos in https://sites.google.com/view/demostart.</abstract><doi>10.48550/arxiv.2409.06613</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2409.06613 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2409_06613 |
source | arXiv.org |
subjects | Computer Science - Learning Computer Science - Robotics |
title | DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T18%3A19%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DemoStart:%20Demonstration-led%20auto-curriculum%20applied%20to%20sim-to-real%20with%20multi-fingered%20robots&rft.au=Bauza,%20Maria&rft.date=2024-09-10&rft_id=info:doi/10.48550/arxiv.2409.06613&rft_dat=%3Carxiv_GOX%3E2409_06613%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |