MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning

Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wu, Bohan, Akinola, Iretiayo, Varley, Jacob, Allen, Peter
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Artificial Intelligence Computer Science - Learning Computer Science - Robotics
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Wu, Bohan Akinola, Iretiayo Varley, Jacob Allen, Peter
description	Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.
doi_str_mv	10.48550/arxiv.1909.04787
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1909_04787</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1909_04787</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-24ad0bf2593f8e4f7e8821f1ea9697bc2208c46cf3fa9362ae2d08a98cddf0a3</originalsourceid><addsrcrecordid>eNotz7FOwzAUhWEvDKjwAEz4BRIc24lttqjQgpQIiWaPbu17K0tpiNwQwdtTCtMZfulIH2N3hci1LUvxAOkrLnnhhMuFNtZcs7atu0fefg5zzDZxPGDCwOsA0xwX5B34OQ7ItwlO07nyJQJ_Qpz4O8aRPpLHI44zbxDSeO437IpgOOHt_67YbvPcrV-y5m37uq6bDCpjMqkhiD3J0imyqMmgtbKgAsFVzuy9lMJ6XXlSBE5VElAGYcFZHwIJUCt2__d64fRTikdI3_0vq7-w1A-PREg_</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><source>arXiv.org</source><creator>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</creator><creatorcontrib>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</creatorcontrib><description>Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.</description><identifier>DOI: 10.48550/arxiv.1909.04787</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning ; Computer Science - Robotics</subject><creationdate>2019-09</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1909.04787$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1909.04787$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wu, Bohan</creatorcontrib><creatorcontrib>Akinola, Iretiayo</creatorcontrib><creatorcontrib>Varley, Jacob</creatorcontrib><creatorcontrib>Allen, Peter</creatorcontrib><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><description>Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7FOwzAUhWEvDKjwAEz4BRIc24lttqjQgpQIiWaPbu17K0tpiNwQwdtTCtMZfulIH2N3hci1LUvxAOkrLnnhhMuFNtZcs7atu0fefg5zzDZxPGDCwOsA0xwX5B34OQ7ItwlO07nyJQJ_Qpz4O8aRPpLHI44zbxDSeO437IpgOOHt_67YbvPcrV-y5m37uq6bDCpjMqkhiD3J0imyqMmgtbKgAsFVzuy9lMJ6XXlSBE5VElAGYcFZHwIJUCt2__d64fRTikdI3_0vq7-w1A-PREg_</recordid><startdate>20190910</startdate><enddate>20190910</enddate><creator>Wu, Bohan</creator><creator>Akinola, Iretiayo</creator><creator>Varley, Jacob</creator><creator>Allen, Peter</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190910</creationdate><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><author>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-24ad0bf2593f8e4f7e8821f1ea9697bc2208c46cf3fa9362ae2d08a98cddf0a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Wu, Bohan</creatorcontrib><creatorcontrib>Akinola, Iretiayo</creatorcontrib><creatorcontrib>Varley, Jacob</creatorcontrib><creatorcontrib>Allen, Peter</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wu, Bohan</au><au>Akinola, Iretiayo</au><au>Varley, Jacob</au><au>Allen, Peter</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</atitle><date>2019-09-10</date><risdate>2019</risdate><abstract>Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it attempts another open-loop regrasp. This work presents MAT, a tactile closed-loop method capable of realizing grasps provided by a coarse initial positioning of the hand above an object. Our algorithm is a deep reinforcement learning (RL) policy optimized through the clipped surrogate objective within a maximum entropy RL framework to balance exploitation and exploration. The method utilizes tactile and proprioceptive information to act through both fine finger motions and larger regrasp movements to execute stable grasps. A novel curriculum of action motion magnitude makes learning more tractable and helps turn common failure cases into successes. Careful selection of features that exhibit small sim-to-real gaps enables this tactile grasping policy, trained purely in simulation, to transfer well to real world environments without the need for additional learning. Experimentally, this methodology improves over a vision-only grasp success rate substantially on a multi-fingered robot hand. When this methodology is used to realize grasps from coarse initial positions provided by a vision-only planner, the system is made dramatically more robust to calibration errors in the camera-robot transform.</abstract><doi>10.48550/arxiv.1909.04787</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1909.04787
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1909_04787
source	arXiv.org
subjects	Computer Science - Artificial Intelligence Computer Science - Learning Computer Science - Robotics
title	MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T17%3A29%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MAT:%20Multi-Fingered%20Adaptive%20Tactile%20Grasping%20via%20Deep%20Reinforcement%20Learning&rft.au=Wu,%20Bohan&rft.date=2019-09-10&rft_id=info:doi/10.48550/arxiv.1909.04787&rft_dat=%3Carxiv_GOX%3E1909_04787%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true