Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers

Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burde...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bi, Jianxin, Lim, Kelvin, Chen, Kaiqi, Huang, Yifei, Soh, Harold
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Learning Computer Science - Robotics
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Bi, Jianxin Lim, Kelvin Chen, Kaiqi Huang, Yifei Soh, Harold
description	Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burden. In this work, we propose a plan-then-control framework aimed at improving the action-data efficiency of inverse dynamics controllers by leveraging observational demonstration data. Specifically, we adopt a Deep Koopman Operator framework to model the dynamical system and utilize observation-only trajectories to learn a latent action representation. This latent representation can then be effectively mapped to real high-dimensional continuous actions using a linear action decoder, requiring minimal action-labeled data. Through experiments on simulated robot manipulation tasks and a real robot experiment with multi-modal expert demonstrations, we demonstrate that our approach significantly enhances action-data efficiency and achieves high task success rates with limited action data.
doi_str_mv	10.48550/arxiv.2410.07584
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2410_07584</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2410_07584</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2410_075843</originalsourceid><addsrcrecordid>eNqFjrEOgjAURbs4GPUDnHw_IKJCZDWg0cjg4ObQvEjRl5RX0lbUvxeIu9NN7jnDEWK6DIMoieNwgfZNTbCK2iPcxEk0FNdjRR49GYZcoWXiO7zIPyCnFqgCtrcOOmgIIaOyfLrOPWtkVtYBcgGZUjWcjKkrZEgNe2u0buFYDErUTk1-OxKz_e6SHuZ9hqwtVWg_ssuRfc76v_EFBE5Axg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers</title><source>arXiv.org</source><creator>Bi, Jianxin ; Lim, Kelvin ; Chen, Kaiqi ; Huang, Yifei ; Soh, Harold</creator><creatorcontrib>Bi, Jianxin ; Lim, Kelvin ; Chen, Kaiqi ; Huang, Yifei ; Soh, Harold</creatorcontrib><description>Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burden. In this work, we propose a plan-then-control framework aimed at improving the action-data efficiency of inverse dynamics controllers by leveraging observational demonstration data. Specifically, we adopt a Deep Koopman Operator framework to model the dynamical system and utilize observation-only trajectories to learn a latent action representation. This latent representation can then be effectively mapped to real high-dimensional continuous actions using a linear action decoder, requiring minimal action-labeled data. Through experiments on simulated robot manipulation tasks and a real robot experiment with multi-modal expert demonstrations, we demonstrate that our approach significantly enhances action-data efficiency and achieves high task success rates with limited action data.</description><identifier>DOI: 10.48550/arxiv.2410.07584</identifier><language>eng</language><subject>Computer Science - Learning ; Computer Science - Robotics</subject><creationdate>2024-10</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2410.07584$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2410.07584$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Bi, Jianxin</creatorcontrib><creatorcontrib>Lim, Kelvin</creatorcontrib><creatorcontrib>Chen, Kaiqi</creatorcontrib><creatorcontrib>Huang, Yifei</creatorcontrib><creatorcontrib>Soh, Harold</creatorcontrib><title>Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers</title><description>Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burden. In this work, we propose a plan-then-control framework aimed at improving the action-data efficiency of inverse dynamics controllers by leveraging observational demonstration data. Specifically, we adopt a Deep Koopman Operator framework to model the dynamical system and utilize observation-only trajectories to learn a latent action representation. This latent representation can then be effectively mapped to real high-dimensional continuous actions using a linear action decoder, requiring minimal action-labeled data. Through experiments on simulated robot manipulation tasks and a real robot experiment with multi-modal expert demonstrations, we demonstrate that our approach significantly enhances action-data efficiency and achieves high task success rates with limited action data.</description><subject>Computer Science - Learning</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrEOgjAURbs4GPUDnHw_IKJCZDWg0cjg4ObQvEjRl5RX0lbUvxeIu9NN7jnDEWK6DIMoieNwgfZNTbCK2iPcxEk0FNdjRR49GYZcoWXiO7zIPyCnFqgCtrcOOmgIIaOyfLrOPWtkVtYBcgGZUjWcjKkrZEgNe2u0buFYDErUTk1-OxKz_e6SHuZ9hqwtVWg_ssuRfc76v_EFBE5Axg</recordid><startdate>20241009</startdate><enddate>20241009</enddate><creator>Bi, Jianxin</creator><creator>Lim, Kelvin</creator><creator>Chen, Kaiqi</creator><creator>Huang, Yifei</creator><creator>Soh, Harold</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241009</creationdate><title>Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers</title><author>Bi, Jianxin ; Lim, Kelvin ; Chen, Kaiqi ; Huang, Yifei ; Soh, Harold</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2410_075843</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Learning</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Bi, Jianxin</creatorcontrib><creatorcontrib>Lim, Kelvin</creatorcontrib><creatorcontrib>Chen, Kaiqi</creatorcontrib><creatorcontrib>Huang, Yifei</creatorcontrib><creatorcontrib>Soh, Harold</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bi, Jianxin</au><au>Lim, Kelvin</au><au>Chen, Kaiqi</au><au>Huang, Yifei</au><au>Soh, Harold</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers</atitle><date>2024-10-09</date><risdate>2024</risdate><abstract>Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burden. In this work, we propose a plan-then-control framework aimed at improving the action-data efficiency of inverse dynamics controllers by leveraging observational demonstration data. Specifically, we adopt a Deep Koopman Operator framework to model the dynamical system and utilize observation-only trajectories to learn a latent action representation. This latent representation can then be effectively mapped to real high-dimensional continuous actions using a linear action decoder, requiring minimal action-labeled data. Through experiments on simulated robot manipulation tasks and a real robot experiment with multi-modal expert demonstrations, we demonstrate that our approach significantly enhances action-data efficiency and achieves high task success rates with limited action data.</abstract><doi>10.48550/arxiv.2410.07584</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2410.07584
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2410_07584
source	arXiv.org
subjects	Computer Science - Learning Computer Science - Robotics
title	Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T05%3A12%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Imitation%20Learning%20with%20Limited%20Actions%20via%20Diffusion%20Planners%20and%20Deep%20Koopman%20Controllers&rft.au=Bi,%20Jianxin&rft.date=2024-10-09&rft_id=info:doi/10.48550/arxiv.2410.07584&rft_dat=%3Carxiv_GOX%3E2410_07584%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true