X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions
For applications in e-commerce, warehouses, healthcare, and home service, robots are often required to search through heaps of objects to grasp a specific target object. For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions. We train a neural network using...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Danielczuk, Michael Angelova, Anelia Vanhoucke, Vincent Goldberg, Ken |
description | For applications in e-commerce, warehouses, healthcare, and home service,
robots are often required to search through heaps of objects to grasp a
specific target object. For mechanical search, we introduce X-Ray, an algorithm
based on learned occupancy distributions. We train a neural network using a
synthetic dataset of RGBD heap images labeled for a set of standard bounding
box targets with varying aspect ratios. X-Ray minimizes support of the learned
distribution as part of a mechanical search policy in both simulated and real
environments. We benchmark these policies against two baseline policies on
1,000 heaps of 15 objects in simulation where the target object is partially or
fully occluded. Results suggest that X-Ray is significantly more efficient, as
it succeeds in extracting the target object 82% of the time, 15% more often
than the best-performing baseline. Experiments on an ABB YuMi robot with 20
heaps of 25 household objects suggest that the learned policy transfers easily
to a physical system, where it outperforms baseline policies by 15% in success
rate with 17% fewer actions. Datasets, videos, and experiments are available at
https://sites.google.com/berkeley.edu/x-ray. |
doi_str_mv | 10.48550/arxiv.2004.09039 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2004_09039</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2004_09039</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-70d6408aa75609bf1222144f52eb13b1df47cec847d64bd17c4c7af9ddfcf2083</originalsourceid><addsrcrecordid>eNotj8tOhDAYRtm4MKMP4Mq-ANiWQsGdGa8JExJnFu7I37-t1DCFFDDi0zsXV9_mfCc5UXTDaCKKLKN3EH7cd8IpFQktaVpeRu1H_A7LPdkYbME7hI5sDQRsie0DAU9qxG7WRpNafRmciFrIxnm3d7_Of5LtPAx9mEhvSXW4-SOHOA_gcSGPbpyCU_Pkej9eRRcWutFc_-8q2j0_7davcVW_vK0fqhhyWcaS6lzQAkBmOS2VZZxzJoTNuFEsVUxbIdFgIeSBU5pJFCjBllpbtJwW6Sq6PWtPqc0Q3B7C0hyTm1Ny-gcyOlI3</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</title><source>arXiv.org</source><creator>Danielczuk, Michael ; Angelova, Anelia ; Vanhoucke, Vincent ; Goldberg, Ken</creator><creatorcontrib>Danielczuk, Michael ; Angelova, Anelia ; Vanhoucke, Vincent ; Goldberg, Ken</creatorcontrib><description>For applications in e-commerce, warehouses, healthcare, and home service,
robots are often required to search through heaps of objects to grasp a
specific target object. For mechanical search, we introduce X-Ray, an algorithm
based on learned occupancy distributions. We train a neural network using a
synthetic dataset of RGBD heap images labeled for a set of standard bounding
box targets with varying aspect ratios. X-Ray minimizes support of the learned
distribution as part of a mechanical search policy in both simulated and real
environments. We benchmark these policies against two baseline policies on
1,000 heaps of 15 objects in simulation where the target object is partially or
fully occluded. Results suggest that X-Ray is significantly more efficient, as
it succeeds in extracting the target object 82% of the time, 15% more often
than the best-performing baseline. Experiments on an ABB YuMi robot with 20
heaps of 25 household objects suggest that the learned policy transfers easily
to a physical system, where it outperforms baseline policies by 15% in success
rate with 17% fewer actions. Datasets, videos, and experiments are available at
https://sites.google.com/berkeley.edu/x-ray.</description><identifier>DOI: 10.48550/arxiv.2004.09039</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Robotics</subject><creationdate>2020-04</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2004.09039$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2004.09039$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Danielczuk, Michael</creatorcontrib><creatorcontrib>Angelova, Anelia</creatorcontrib><creatorcontrib>Vanhoucke, Vincent</creatorcontrib><creatorcontrib>Goldberg, Ken</creatorcontrib><title>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</title><description>For applications in e-commerce, warehouses, healthcare, and home service,
robots are often required to search through heaps of objects to grasp a
specific target object. For mechanical search, we introduce X-Ray, an algorithm
based on learned occupancy distributions. We train a neural network using a
synthetic dataset of RGBD heap images labeled for a set of standard bounding
box targets with varying aspect ratios. X-Ray minimizes support of the learned
distribution as part of a mechanical search policy in both simulated and real
environments. We benchmark these policies against two baseline policies on
1,000 heaps of 15 objects in simulation where the target object is partially or
fully occluded. Results suggest that X-Ray is significantly more efficient, as
it succeeds in extracting the target object 82% of the time, 15% more often
than the best-performing baseline. Experiments on an ABB YuMi robot with 20
heaps of 25 household objects suggest that the learned policy transfers easily
to a physical system, where it outperforms baseline policies by 15% in success
rate with 17% fewer actions. Datasets, videos, and experiments are available at
https://sites.google.com/berkeley.edu/x-ray.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOhDAYRtm4MKMP4Mq-ANiWQsGdGa8JExJnFu7I37-t1DCFFDDi0zsXV9_mfCc5UXTDaCKKLKN3EH7cd8IpFQktaVpeRu1H_A7LPdkYbME7hI5sDQRsie0DAU9qxG7WRpNafRmciFrIxnm3d7_Of5LtPAx9mEhvSXW4-SOHOA_gcSGPbpyCU_Pkej9eRRcWutFc_-8q2j0_7davcVW_vK0fqhhyWcaS6lzQAkBmOS2VZZxzJoTNuFEsVUxbIdFgIeSBU5pJFCjBllpbtJwW6Sq6PWtPqc0Q3B7C0hyTm1Ny-gcyOlI3</recordid><startdate>20200419</startdate><enddate>20200419</enddate><creator>Danielczuk, Michael</creator><creator>Angelova, Anelia</creator><creator>Vanhoucke, Vincent</creator><creator>Goldberg, Ken</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20200419</creationdate><title>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</title><author>Danielczuk, Michael ; Angelova, Anelia ; Vanhoucke, Vincent ; Goldberg, Ken</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-70d6408aa75609bf1222144f52eb13b1df47cec847d64bd17c4c7af9ddfcf2083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Danielczuk, Michael</creatorcontrib><creatorcontrib>Angelova, Anelia</creatorcontrib><creatorcontrib>Vanhoucke, Vincent</creatorcontrib><creatorcontrib>Goldberg, Ken</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Danielczuk, Michael</au><au>Angelova, Anelia</au><au>Vanhoucke, Vincent</au><au>Goldberg, Ken</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</atitle><date>2020-04-19</date><risdate>2020</risdate><abstract>For applications in e-commerce, warehouses, healthcare, and home service,
robots are often required to search through heaps of objects to grasp a
specific target object. For mechanical search, we introduce X-Ray, an algorithm
based on learned occupancy distributions. We train a neural network using a
synthetic dataset of RGBD heap images labeled for a set of standard bounding
box targets with varying aspect ratios. X-Ray minimizes support of the learned
distribution as part of a mechanical search policy in both simulated and real
environments. We benchmark these policies against two baseline policies on
1,000 heaps of 15 objects in simulation where the target object is partially or
fully occluded. Results suggest that X-Ray is significantly more efficient, as
it succeeds in extracting the target object 82% of the time, 15% more often
than the best-performing baseline. Experiments on an ABB YuMi robot with 20
heaps of 25 household objects suggest that the learned policy transfers easily
to a physical system, where it outperforms baseline policies by 15% in success
rate with 17% fewer actions. Datasets, videos, and experiments are available at
https://sites.google.com/berkeley.edu/x-ray.</abstract><doi>10.48550/arxiv.2004.09039</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2004.09039 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2004_09039 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition Computer Science - Robotics |
title | X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T04%3A37%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=X-Ray:%20Mechanical%20Search%20for%20an%20Occluded%20Object%20by%20Minimizing%20Support%20of%20Learned%20Occupancy%20Distributions&rft.au=Danielczuk,%20Michael&rft.date=2020-04-19&rft_id=info:doi/10.48550/arxiv.2004.09039&rft_dat=%3Carxiv_GOX%3E2004_09039%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |