X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions

For applications in e-commerce, warehouses, healthcare, and home service, robots are often required to search through heaps of objects to grasp a specific target object. For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions. We train a neural network using...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Danielczuk, Michael, Angelova, Anelia, Vanhoucke, Vincent, Goldberg, Ken
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Danielczuk, Michael
Angelova, Anelia
Vanhoucke, Vincent
Goldberg, Ken
description For applications in e-commerce, warehouses, healthcare, and home service, robots are often required to search through heaps of objects to grasp a specific target object. For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions. We train a neural network using a synthetic dataset of RGBD heap images labeled for a set of standard bounding box targets with varying aspect ratios. X-Ray minimizes support of the learned distribution as part of a mechanical search policy in both simulated and real environments. We benchmark these policies against two baseline policies on 1,000 heaps of 15 objects in simulation where the target object is partially or fully occluded. Results suggest that X-Ray is significantly more efficient, as it succeeds in extracting the target object 82% of the time, 15% more often than the best-performing baseline. Experiments on an ABB YuMi robot with 20 heaps of 25 household objects suggest that the learned policy transfers easily to a physical system, where it outperforms baseline policies by 15% in success rate with 17% fewer actions. Datasets, videos, and experiments are available at https://sites.google.com/berkeley.edu/x-ray.
doi_str_mv 10.48550/arxiv.2004.09039
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2004_09039</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2004_09039</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-70d6408aa75609bf1222144f52eb13b1df47cec847d64bd17c4c7af9ddfcf2083</originalsourceid><addsrcrecordid>eNotj8tOhDAYRtm4MKMP4Mq-ANiWQsGdGa8JExJnFu7I37-t1DCFFDDi0zsXV9_mfCc5UXTDaCKKLKN3EH7cd8IpFQktaVpeRu1H_A7LPdkYbME7hI5sDQRsie0DAU9qxG7WRpNafRmciFrIxnm3d7_Of5LtPAx9mEhvSXW4-SOHOA_gcSGPbpyCU_Pkej9eRRcWutFc_-8q2j0_7davcVW_vK0fqhhyWcaS6lzQAkBmOS2VZZxzJoTNuFEsVUxbIdFgIeSBU5pJFCjBllpbtJwW6Sq6PWtPqc0Q3B7C0hyTm1Ny-gcyOlI3</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</title><source>arXiv.org</source><creator>Danielczuk, Michael ; Angelova, Anelia ; Vanhoucke, Vincent ; Goldberg, Ken</creator><creatorcontrib>Danielczuk, Michael ; Angelova, Anelia ; Vanhoucke, Vincent ; Goldberg, Ken</creatorcontrib><description>For applications in e-commerce, warehouses, healthcare, and home service, robots are often required to search through heaps of objects to grasp a specific target object. For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions. We train a neural network using a synthetic dataset of RGBD heap images labeled for a set of standard bounding box targets with varying aspect ratios. X-Ray minimizes support of the learned distribution as part of a mechanical search policy in both simulated and real environments. We benchmark these policies against two baseline policies on 1,000 heaps of 15 objects in simulation where the target object is partially or fully occluded. Results suggest that X-Ray is significantly more efficient, as it succeeds in extracting the target object 82% of the time, 15% more often than the best-performing baseline. Experiments on an ABB YuMi robot with 20 heaps of 25 household objects suggest that the learned policy transfers easily to a physical system, where it outperforms baseline policies by 15% in success rate with 17% fewer actions. Datasets, videos, and experiments are available at https://sites.google.com/berkeley.edu/x-ray.</description><identifier>DOI: 10.48550/arxiv.2004.09039</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Robotics</subject><creationdate>2020-04</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2004.09039$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2004.09039$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Danielczuk, Michael</creatorcontrib><creatorcontrib>Angelova, Anelia</creatorcontrib><creatorcontrib>Vanhoucke, Vincent</creatorcontrib><creatorcontrib>Goldberg, Ken</creatorcontrib><title>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</title><description>For applications in e-commerce, warehouses, healthcare, and home service, robots are often required to search through heaps of objects to grasp a specific target object. For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions. We train a neural network using a synthetic dataset of RGBD heap images labeled for a set of standard bounding box targets with varying aspect ratios. X-Ray minimizes support of the learned distribution as part of a mechanical search policy in both simulated and real environments. We benchmark these policies against two baseline policies on 1,000 heaps of 15 objects in simulation where the target object is partially or fully occluded. Results suggest that X-Ray is significantly more efficient, as it succeeds in extracting the target object 82% of the time, 15% more often than the best-performing baseline. Experiments on an ABB YuMi robot with 20 heaps of 25 household objects suggest that the learned policy transfers easily to a physical system, where it outperforms baseline policies by 15% in success rate with 17% fewer actions. Datasets, videos, and experiments are available at https://sites.google.com/berkeley.edu/x-ray.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOhDAYRtm4MKMP4Mq-ANiWQsGdGa8JExJnFu7I37-t1DCFFDDi0zsXV9_mfCc5UXTDaCKKLKN3EH7cd8IpFQktaVpeRu1H_A7LPdkYbME7hI5sDQRsie0DAU9qxG7WRpNafRmciFrIxnm3d7_Of5LtPAx9mEhvSXW4-SOHOA_gcSGPbpyCU_Pkej9eRRcWutFc_-8q2j0_7davcVW_vK0fqhhyWcaS6lzQAkBmOS2VZZxzJoTNuFEsVUxbIdFgIeSBU5pJFCjBllpbtJwW6Sq6PWtPqc0Q3B7C0hyTm1Ny-gcyOlI3</recordid><startdate>20200419</startdate><enddate>20200419</enddate><creator>Danielczuk, Michael</creator><creator>Angelova, Anelia</creator><creator>Vanhoucke, Vincent</creator><creator>Goldberg, Ken</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20200419</creationdate><title>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</title><author>Danielczuk, Michael ; Angelova, Anelia ; Vanhoucke, Vincent ; Goldberg, Ken</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-70d6408aa75609bf1222144f52eb13b1df47cec847d64bd17c4c7af9ddfcf2083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Danielczuk, Michael</creatorcontrib><creatorcontrib>Angelova, Anelia</creatorcontrib><creatorcontrib>Vanhoucke, Vincent</creatorcontrib><creatorcontrib>Goldberg, Ken</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Danielczuk, Michael</au><au>Angelova, Anelia</au><au>Vanhoucke, Vincent</au><au>Goldberg, Ken</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions</atitle><date>2020-04-19</date><risdate>2020</risdate><abstract>For applications in e-commerce, warehouses, healthcare, and home service, robots are often required to search through heaps of objects to grasp a specific target object. For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions. We train a neural network using a synthetic dataset of RGBD heap images labeled for a set of standard bounding box targets with varying aspect ratios. X-Ray minimizes support of the learned distribution as part of a mechanical search policy in both simulated and real environments. We benchmark these policies against two baseline policies on 1,000 heaps of 15 objects in simulation where the target object is partially or fully occluded. Results suggest that X-Ray is significantly more efficient, as it succeeds in extracting the target object 82% of the time, 15% more often than the best-performing baseline. Experiments on an ABB YuMi robot with 20 heaps of 25 household objects suggest that the learned policy transfers easily to a physical system, where it outperforms baseline policies by 15% in success rate with 17% fewer actions. Datasets, videos, and experiments are available at https://sites.google.com/berkeley.edu/x-ray.</abstract><doi>10.48550/arxiv.2004.09039</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2004.09039
ispartof
issn
language eng
recordid cdi_arxiv_primary_2004_09039
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
Computer Science - Robotics
title X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T04%3A37%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=X-Ray:%20Mechanical%20Search%20for%20an%20Occluded%20Object%20by%20Minimizing%20Support%20of%20Learned%20Occupancy%20Distributions&rft.au=Danielczuk,%20Michael&rft.date=2020-04-19&rft_id=info:doi/10.48550/arxiv.2004.09039&rft_dat=%3Carxiv_GOX%3E2004_09039%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true