MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning

Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Wu, Bohan, Akinola, Iretiayo, Varley, Jacob, Allen, Peter
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Wu, Bohan
Akinola, Iretiayo
Varley, Jacob
Allen, Peter
description Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.
doi_str_mv 10.48550/arxiv.1909.04787
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1909_04787</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1909_04787</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-24ad0bf2593f8e4f7e8821f1ea9697bc2208c46cf3fa9362ae2d08a98cddf0a3</originalsourceid><addsrcrecordid>eNotz7FOwzAUhWEvDKjwAEz4BRIc24lttqjQgpQIiWaPbu17K0tpiNwQwdtTCtMZfulIH2N3hci1LUvxAOkrLnnhhMuFNtZcs7atu0fefg5zzDZxPGDCwOsA0xwX5B34OQ7ItwlO07nyJQJ_Qpz4O8aRPpLHI44zbxDSeO437IpgOOHt_67YbvPcrV-y5m37uq6bDCpjMqkhiD3J0imyqMmgtbKgAsFVzuy9lMJ6XXlSBE5VElAGYcFZHwIJUCt2__d64fRTikdI3_0vq7-w1A-PREg_</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><source>arXiv.org</source><creator>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</creator><creatorcontrib>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</creatorcontrib><description>Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.</description><identifier>DOI: 10.48550/arxiv.1909.04787</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning ; Computer Science - Robotics</subject><creationdate>2019-09</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1909.04787$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1909.04787$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wu, Bohan</creatorcontrib><creatorcontrib>Akinola, Iretiayo</creatorcontrib><creatorcontrib>Varley, Jacob</creatorcontrib><creatorcontrib>Allen, Peter</creatorcontrib><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><description>Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7FOwzAUhWEvDKjwAEz4BRIc24lttqjQgpQIiWaPbu17K0tpiNwQwdtTCtMZfulIH2N3hci1LUvxAOkrLnnhhMuFNtZcs7atu0fefg5zzDZxPGDCwOsA0xwX5B34OQ7ItwlO07nyJQJ_Qpz4O8aRPpLHI44zbxDSeO437IpgOOHt_67YbvPcrV-y5m37uq6bDCpjMqkhiD3J0imyqMmgtbKgAsFVzuy9lMJ6XXlSBE5VElAGYcFZHwIJUCt2__d64fRTikdI3_0vq7-w1A-PREg_</recordid><startdate>20190910</startdate><enddate>20190910</enddate><creator>Wu, Bohan</creator><creator>Akinola, Iretiayo</creator><creator>Varley, Jacob</creator><creator>Allen, Peter</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190910</creationdate><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><author>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-24ad0bf2593f8e4f7e8821f1ea9697bc2208c46cf3fa9362ae2d08a98cddf0a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Wu, Bohan</creatorcontrib><creatorcontrib>Akinola, Iretiayo</creatorcontrib><creatorcontrib>Varley, Jacob</creatorcontrib><creatorcontrib>Allen, Peter</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wu, Bohan</au><au>Akinola, Iretiayo</au><au>Varley, Jacob</au><au>Allen, Peter</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</atitle><date>2019-09-10</date><risdate>2019</risdate><abstract>Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.</abstract><doi>10.48550/arxiv.1909.04787</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1909.04787
ispartof
issn
language eng
recordid cdi_arxiv_primary_1909_04787
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Learning
Computer Science - Robotics
title MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T17%3A29%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MAT:%20Multi-Fingered%20Adaptive%20Tactile%20Grasping%20via%20Deep%20Reinforcement%20Learning&rft.au=Wu,%20Bohan&rft.date=2019-09-10&rft_id=info:doi/10.48550/arxiv.1909.04787&rft_dat=%3Carxiv_GOX%3E1909_04787%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true