MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning
Vision-based grasping systems typically adopt an open-loop execution of a planned grasp. This policy can fail due to many reasons, including ubiquitous calibration error. Recovery from a failed grasp is further complicated by visual occlusion, as the hand is usually occluding the vision sensor as it...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Wu, Bohan Akinola, Iretiayo Varley, Jacob Allen, Peter |
description | Vision-based grasping systems typically adopt an open-loop execution of a
planned grasp. This policy can fail due to many reasons, including ubiquitous
calibration error. Recovery from a failed grasp is further complicated by
visual occlusion, as the hand is usually occluding the vision sensor as it
attempts another open-loop regrasp. This work presents MAT, a tactile
closed-loop method capable of realizing grasps provided by a coarse initial
positioning of the hand above an object. Our algorithm is a deep reinforcement
learning (RL) policy optimized through the clipped surrogate objective within a
maximum entropy RL framework to balance exploitation and exploration. The
method utilizes tactile and proprioceptive information to act through both fine
finger motions and larger regrasp movements to execute stable grasps. A novel
curriculum of action motion magnitude makes learning more tractable and helps
turn common failure cases into successes. Careful selection of features that
exhibit small sim-to-real gaps enables this tactile grasping policy, trained
purely in simulation, to transfer well to real world environments without the
need for additional learning. Experimentally, this methodology improves over a
vision-only grasp success rate substantially on a multi-fingered robot hand.
When this methodology is used to realize grasps from coarse initial positions
provided by a vision-only planner, the system is made dramatically more robust
to calibration errors in the camera-robot transform. |
doi_str_mv | 10.48550/arxiv.1909.04787 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1909_04787</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1909_04787</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-24ad0bf2593f8e4f7e8821f1ea9697bc2208c46cf3fa9362ae2d08a98cddf0a3</originalsourceid><addsrcrecordid>eNotz7FOwzAUhWEvDKjwAEz4BRIc24lttqjQgpQIiWaPbu17K0tpiNwQwdtTCtMZfulIH2N3hci1LUvxAOkrLnnhhMuFNtZcs7atu0fefg5zzDZxPGDCwOsA0xwX5B34OQ7ItwlO07nyJQJ_Qpz4O8aRPpLHI44zbxDSeO437IpgOOHt_67YbvPcrV-y5m37uq6bDCpjMqkhiD3J0imyqMmgtbKgAsFVzuy9lMJ6XXlSBE5VElAGYcFZHwIJUCt2__d64fRTikdI3_0vq7-w1A-PREg_</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><source>arXiv.org</source><creator>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</creator><creatorcontrib>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</creatorcontrib><description>Vision-based grasping systems typically adopt an open-loop execution of a
planned grasp. This policy can fail due to many reasons, including ubiquitous
calibration error. Recovery from a failed grasp is further complicated by
visual occlusion, as the hand is usually occluding the vision sensor as it
attempts another open-loop regrasp. This work presents MAT, a tactile
closed-loop method capable of realizing grasps provided by a coarse initial
positioning of the hand above an object. Our algorithm is a deep reinforcement
learning (RL) policy optimized through the clipped surrogate objective within a
maximum entropy RL framework to balance exploitation and exploration. The
method utilizes tactile and proprioceptive information to act through both fine
finger motions and larger regrasp movements to execute stable grasps. A novel
curriculum of action motion magnitude makes learning more tractable and helps
turn common failure cases into successes. Careful selection of features that
exhibit small sim-to-real gaps enables this tactile grasping policy, trained
purely in simulation, to transfer well to real world environments without the
need for additional learning. Experimentally, this methodology improves over a
vision-only grasp success rate substantially on a multi-fingered robot hand.
When this methodology is used to realize grasps from coarse initial positions
provided by a vision-only planner, the system is made dramatically more robust
to calibration errors in the camera-robot transform.</description><identifier>DOI: 10.48550/arxiv.1909.04787</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning ; Computer Science - Robotics</subject><creationdate>2019-09</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1909.04787$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1909.04787$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wu, Bohan</creatorcontrib><creatorcontrib>Akinola, Iretiayo</creatorcontrib><creatorcontrib>Varley, Jacob</creatorcontrib><creatorcontrib>Allen, Peter</creatorcontrib><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><description>Vision-based grasping systems typically adopt an open-loop execution of a
planned grasp. This policy can fail due to many reasons, including ubiquitous
calibration error. Recovery from a failed grasp is further complicated by
visual occlusion, as the hand is usually occluding the vision sensor as it
attempts another open-loop regrasp. This work presents MAT, a tactile
closed-loop method capable of realizing grasps provided by a coarse initial
positioning of the hand above an object. Our algorithm is a deep reinforcement
learning (RL) policy optimized through the clipped surrogate objective within a
maximum entropy RL framework to balance exploitation and exploration. The
method utilizes tactile and proprioceptive information to act through both fine
finger motions and larger regrasp movements to execute stable grasps. A novel
curriculum of action motion magnitude makes learning more tractable and helps
turn common failure cases into successes. Careful selection of features that
exhibit small sim-to-real gaps enables this tactile grasping policy, trained
purely in simulation, to transfer well to real world environments without the
need for additional learning. Experimentally, this methodology improves over a
vision-only grasp success rate substantially on a multi-fingered robot hand.
When this methodology is used to realize grasps from coarse initial positions
provided by a vision-only planner, the system is made dramatically more robust
to calibration errors in the camera-robot transform.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7FOwzAUhWEvDKjwAEz4BRIc24lttqjQgpQIiWaPbu17K0tpiNwQwdtTCtMZfulIH2N3hci1LUvxAOkrLnnhhMuFNtZcs7atu0fefg5zzDZxPGDCwOsA0xwX5B34OQ7ItwlO07nyJQJ_Qpz4O8aRPpLHI44zbxDSeO437IpgOOHt_67YbvPcrV-y5m37uq6bDCpjMqkhiD3J0imyqMmgtbKgAsFVzuy9lMJ6XXlSBE5VElAGYcFZHwIJUCt2__d64fRTikdI3_0vq7-w1A-PREg_</recordid><startdate>20190910</startdate><enddate>20190910</enddate><creator>Wu, Bohan</creator><creator>Akinola, Iretiayo</creator><creator>Varley, Jacob</creator><creator>Allen, Peter</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190910</creationdate><title>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</title><author>Wu, Bohan ; Akinola, Iretiayo ; Varley, Jacob ; Allen, Peter</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-24ad0bf2593f8e4f7e8821f1ea9697bc2208c46cf3fa9362ae2d08a98cddf0a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Wu, Bohan</creatorcontrib><creatorcontrib>Akinola, Iretiayo</creatorcontrib><creatorcontrib>Varley, Jacob</creatorcontrib><creatorcontrib>Allen, Peter</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wu, Bohan</au><au>Akinola, Iretiayo</au><au>Varley, Jacob</au><au>Allen, Peter</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning</atitle><date>2019-09-10</date><risdate>2019</risdate><abstract>Vision-based grasping systems typically adopt an open-loop execution of a
planned grasp. This policy can fail due to many reasons, including ubiquitous
calibration error. Recovery from a failed grasp is further complicated by
visual occlusion, as the hand is usually occluding the vision sensor as it
attempts another open-loop regrasp. This work presents MAT, a tactile
closed-loop method capable of realizing grasps provided by a coarse initial
positioning of the hand above an object. Our algorithm is a deep reinforcement
learning (RL) policy optimized through the clipped surrogate objective within a
maximum entropy RL framework to balance exploitation and exploration. The
method utilizes tactile and proprioceptive information to act through both fine
finger motions and larger regrasp movements to execute stable grasps. A novel
curriculum of action motion magnitude makes learning more tractable and helps
turn common failure cases into successes. Careful selection of features that
exhibit small sim-to-real gaps enables this tactile grasping policy, trained
purely in simulation, to transfer well to real world environments without the
need for additional learning. Experimentally, this methodology improves over a
vision-only grasp success rate substantially on a multi-fingered robot hand.
When this methodology is used to realize grasps from coarse initial positions
provided by a vision-only planner, the system is made dramatically more robust
to calibration errors in the camera-robot transform.</abstract><doi>10.48550/arxiv.1909.04787</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.1909.04787 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_1909_04787 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Learning Computer Science - Robotics |
title | MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T17%3A29%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MAT:%20Multi-Fingered%20Adaptive%20Tactile%20Grasping%20via%20Deep%20Reinforcement%20Learning&rft.au=Wu,%20Bohan&rft.date=2019-09-10&rft_id=info:doi/10.48550/arxiv.1909.04787&rft_dat=%3Carxiv_GOX%3E1909_04787%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |