Admittance Visuomotor Policy Learning for General-Purpose Contact-Rich Manipulations

Contact force in contact-rich environments is an essential modality for robots to perform general-purpose manipulation tasks, as it provides information to compensate for the deficiencies of visual and proprioceptive data in collision perception, high-precision grasping, and efficient manipulation....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhou, Bo, Jiao, Ruixuan, Li, Yi, Yuan, Xiaogang, Fang, Fang, Li, Shihua
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Zhou, Bo
Jiao, Ruixuan
Li, Yi
Yuan, Xiaogang
Fang, Fang
Li, Shihua
description Contact force in contact-rich environments is an essential modality for robots to perform general-purpose manipulation tasks, as it provides information to compensate for the deficiencies of visual and proprioceptive data in collision perception, high-precision grasping, and efficient manipulation. In this paper, we propose an admittance visuomotor policy framework for continuous, general-purpose, contact-rich manipulations. During demonstrations, we designed a low-cost, user-friendly teleoperation system with contact interaction, aiming to gather compliant robot demonstrations and accelerate the data collection process. During training and inference, we propose a diffusion-based model to plan action trajectories and desired contact forces from multimodal observation that includes contact force, vision and proprioception. We utilize an admittance controller for compliance action execution. A comparative evaluation with two state-of-the-art methods was conducted on five challenging tasks, each focusing on different action primitives, to demonstrate our framework's generalization capabilities. Results show our framework achieves the highest success rate and exhibits smoother and more efficient contact compared to other methods, the contact force required to complete each tasks was reduced on average by 48.8%, and the success rate was increased on average by 15.3%. Videos are available at https://ryanjiao.github.io/AdmitDiffPolicy/.
doi_str_mv 10.48550/arxiv.2409.14440
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_14440</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_14440</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_144403</originalsourceid><addsrcrecordid>eNqFjr0KwjAURrM4iPoATuYFWlNNQUcp_gwKRUrXcompXkhvSpKKfXu1uDsd-DjwHcbmiYjlJk3FEtwLn_FKim2cSCnFmBW7W4MhACnNS_SdbWywjufWoOr5WYMjpDuvP9tRk3ZgorxzrfWaZ5YCqBBdUT34BQjbzkBAS37KRjUYr2c_TtjisC-yUzT8V63DBlxffTuqoWP933gDUxc-oA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Admittance Visuomotor Policy Learning for General-Purpose Contact-Rich Manipulations</title><source>arXiv.org</source><creator>Zhou, Bo ; Jiao, Ruixuan ; Li, Yi ; Yuan, Xiaogang ; Fang, Fang ; Li, Shihua</creator><creatorcontrib>Zhou, Bo ; Jiao, Ruixuan ; Li, Yi ; Yuan, Xiaogang ; Fang, Fang ; Li, Shihua</creatorcontrib><description>Contact force in contact-rich environments is an essential modality for robots to perform general-purpose manipulation tasks, as it provides information to compensate for the deficiencies of visual and proprioceptive data in collision perception, high-precision grasping, and efficient manipulation. In this paper, we propose an admittance visuomotor policy framework for continuous, general-purpose, contact-rich manipulations. During demonstrations, we designed a low-cost, user-friendly teleoperation system with contact interaction, aiming to gather compliant robot demonstrations and accelerate the data collection process. During training and inference, we propose a diffusion-based model to plan action trajectories and desired contact forces from multimodal observation that includes contact force, vision and proprioception. We utilize an admittance controller for compliance action execution. A comparative evaluation with two state-of-the-art methods was conducted on five challenging tasks, each focusing on different action primitives, to demonstrate our framework's generalization capabilities. Results show our framework achieves the highest success rate and exhibits smoother and more efficient contact compared to other methods, the contact force required to complete each tasks was reduced on average by 48.8%, and the success rate was increased on average by 15.3%. Videos are available at https://ryanjiao.github.io/AdmitDiffPolicy/.</description><identifier>DOI: 10.48550/arxiv.2409.14440</identifier><language>eng</language><subject>Computer Science - Robotics</subject><creationdate>2024-09</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.14440$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.14440$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhou, Bo</creatorcontrib><creatorcontrib>Jiao, Ruixuan</creatorcontrib><creatorcontrib>Li, Yi</creatorcontrib><creatorcontrib>Yuan, Xiaogang</creatorcontrib><creatorcontrib>Fang, Fang</creatorcontrib><creatorcontrib>Li, Shihua</creatorcontrib><title>Admittance Visuomotor Policy Learning for General-Purpose Contact-Rich Manipulations</title><description>Contact force in contact-rich environments is an essential modality for robots to perform general-purpose manipulation tasks, as it provides information to compensate for the deficiencies of visual and proprioceptive data in collision perception, high-precision grasping, and efficient manipulation. In this paper, we propose an admittance visuomotor policy framework for continuous, general-purpose, contact-rich manipulations. During demonstrations, we designed a low-cost, user-friendly teleoperation system with contact interaction, aiming to gather compliant robot demonstrations and accelerate the data collection process. During training and inference, we propose a diffusion-based model to plan action trajectories and desired contact forces from multimodal observation that includes contact force, vision and proprioception. We utilize an admittance controller for compliance action execution. A comparative evaluation with two state-of-the-art methods was conducted on five challenging tasks, each focusing on different action primitives, to demonstrate our framework's generalization capabilities. Results show our framework achieves the highest success rate and exhibits smoother and more efficient contact compared to other methods, the contact force required to complete each tasks was reduced on average by 48.8%, and the success rate was increased on average by 15.3%. Videos are available at https://ryanjiao.github.io/AdmitDiffPolicy/.</description><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjr0KwjAURrM4iPoATuYFWlNNQUcp_gwKRUrXcompXkhvSpKKfXu1uDsd-DjwHcbmiYjlJk3FEtwLn_FKim2cSCnFmBW7W4MhACnNS_SdbWywjufWoOr5WYMjpDuvP9tRk3ZgorxzrfWaZ5YCqBBdUT34BQjbzkBAS37KRjUYr2c_TtjisC-yUzT8V63DBlxffTuqoWP933gDUxc-oA</recordid><startdate>20240922</startdate><enddate>20240922</enddate><creator>Zhou, Bo</creator><creator>Jiao, Ruixuan</creator><creator>Li, Yi</creator><creator>Yuan, Xiaogang</creator><creator>Fang, Fang</creator><creator>Li, Shihua</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240922</creationdate><title>Admittance Visuomotor Policy Learning for General-Purpose Contact-Rich Manipulations</title><author>Zhou, Bo ; Jiao, Ruixuan ; Li, Yi ; Yuan, Xiaogang ; Fang, Fang ; Li, Shihua</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_144403</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Bo</creatorcontrib><creatorcontrib>Jiao, Ruixuan</creatorcontrib><creatorcontrib>Li, Yi</creatorcontrib><creatorcontrib>Yuan, Xiaogang</creatorcontrib><creatorcontrib>Fang, Fang</creatorcontrib><creatorcontrib>Li, Shihua</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhou, Bo</au><au>Jiao, Ruixuan</au><au>Li, Yi</au><au>Yuan, Xiaogang</au><au>Fang, Fang</au><au>Li, Shihua</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Admittance Visuomotor Policy Learning for General-Purpose Contact-Rich Manipulations</atitle><date>2024-09-22</date><risdate>2024</risdate><abstract>Contact force in contact-rich environments is an essential modality for robots to perform general-purpose manipulation tasks, as it provides information to compensate for the deficiencies of visual and proprioceptive data in collision perception, high-precision grasping, and efficient manipulation. In this paper, we propose an admittance visuomotor policy framework for continuous, general-purpose, contact-rich manipulations. During demonstrations, we designed a low-cost, user-friendly teleoperation system with contact interaction, aiming to gather compliant robot demonstrations and accelerate the data collection process. During training and inference, we propose a diffusion-based model to plan action trajectories and desired contact forces from multimodal observation that includes contact force, vision and proprioception. We utilize an admittance controller for compliance action execution. A comparative evaluation with two state-of-the-art methods was conducted on five challenging tasks, each focusing on different action primitives, to demonstrate our framework's generalization capabilities. Results show our framework achieves the highest success rate and exhibits smoother and more efficient contact compared to other methods, the contact force required to complete each tasks was reduced on average by 48.8%, and the success rate was increased on average by 15.3%. Videos are available at https://ryanjiao.github.io/AdmitDiffPolicy/.</abstract><doi>10.48550/arxiv.2409.14440</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2409.14440
ispartof
issn
language eng
recordid cdi_arxiv_primary_2409_14440
source arXiv.org
subjects Computer Science - Robotics
title Admittance Visuomotor Policy Learning for General-Purpose Contact-Rich Manipulations
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T09%3A38%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Admittance%20Visuomotor%20Policy%20Learning%20for%20General-Purpose%20Contact-Rich%20Manipulations&rft.au=Zhou,%20Bo&rft.date=2024-09-22&rft_id=info:doi/10.48550/arxiv.2409.14440&rft_dat=%3Carxiv_GOX%3E2409_14440%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true