Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes

Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offl...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence 2006-10, Vol.28 (10), p.1584-1601
Hauptverfasser: Mian, A.S., Bennamoun, M., Owens, R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1601
container_issue 10
container_start_page 1584
container_title IEEE transactions on pattern analysis and machine intelligence
container_volume 28
creator Mian, A.S.
Bennamoun, M.
Owens, R.
description Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table-based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and, if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency
doi_str_mv 10.1109/TPAMI.2006.213
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_pubmed_primary_16986541</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1677516</ieee_id><sourcerecordid>896193175</sourcerecordid><originalsourceid>FETCH-LOGICAL-c447t-91d0cc4d9dc2bd579a8d60464fb90f268b946e017504b96cf45cc0e87a0718eb3</originalsourceid><addsrcrecordid>eNp90ctLxDAQBvAgiq6PqxdBiqCeuk7aPI-6PkFRdD2XNJlql26rSXvwvze6C4oHT4HhNx9hPkJ2KYwpBX0yfTi9uxlnAGKc0XyFjKjOdZrzXK-SEVCRpUplaoNshjADoIxDvk42qNBKcEZH5Hn66hHT83qObai71jTJXeewSc9MQJfclzO0ffKItntp6z6CxLQuecKX6HvzPajbZNIMfY8-LjxZbDFsk7XKNAF3lu8Web68mE6u09v7q5vJ6W1qGZN9qqkDa5nTzmal41Ib5QQwwapSQ5UJVWomEKjkwEotbMW4tYBKGpBUYZlvkeNF7pvv3gcMfTGvg8WmMS12QyiUFvEecT_Ko3-lUEponrMID_7AWTf4eJeYJrikVEoV0XiBrO9C8FgVb76eG_9RUCi-eim-eym-eiliL3Fhf5k6lHN0P3xZRASHS2CCNU3lTWvr8OMUSBb_GN3ewtWI-CtGSk5F_gly_p0W</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>865711778</pqid></control><display><type>article</type><title>Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes</title><source>IEEE Electronic Library (IEL)</source><creator>Mian, A.S. ; Bennamoun, M. ; Owens, R.</creator><creatorcontrib>Mian, A.S. ; Bennamoun, M. ; Owens, R.</creatorcontrib><description>Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table-based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and, if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency</description><identifier>ISSN: 0162-8828</identifier><identifier>EISSN: 1939-3539</identifier><identifier>EISSN: 2160-9292</identifier><identifier>DOI: 10.1109/TPAMI.2006.213</identifier><identifier>PMID: 16986541</identifier><identifier>CODEN: ITPIDJ</identifier><language>eng</language><publisher>Los Alamitos, CA: IEEE</publisher><subject>3D object recognition ; 3D representation ; Algorithms ; Applied sciences ; Artificial Intelligence ; Casting ; Cluster Analysis ; Computer science; control theory; systems ; Exact sciences and technology ; geometric hashing ; Image converters ; Image Enhancement - methods ; Image Interpretation, Computer-Assisted - methods ; Image recognition ; Image segmentation ; Imaging, Three-Dimensional - methods ; Information Storage and Retrieval - methods ; Layout ; Libraries ; Mathematical analysis ; Multidimensional systems ; Multiview correspondence ; Object recognition ; Pattern Recognition, Automated - methods ; Pattern recognition. Digital image processing. Computational geometry ; Recognition ; registration ; Segmentation ; shape descriptor ; Tensile stress ; Tensors ; Three dimensional ; Voting</subject><ispartof>IEEE transactions on pattern analysis and machine intelligence, 2006-10, Vol.28 (10), p.1584-1601</ispartof><rights>2006 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c447t-91d0cc4d9dc2bd579a8d60464fb90f268b946e017504b96cf45cc0e87a0718eb3</citedby><cites>FETCH-LOGICAL-c447t-91d0cc4d9dc2bd579a8d60464fb90f268b946e017504b96cf45cc0e87a0718eb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1677516$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>315,782,786,798,27933,27934,54767</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1677516$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=18074688$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16986541$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mian, A.S.</creatorcontrib><creatorcontrib>Bennamoun, M.</creatorcontrib><creatorcontrib>Owens, R.</creatorcontrib><title>Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes</title><title>IEEE transactions on pattern analysis and machine intelligence</title><addtitle>TPAMI</addtitle><addtitle>IEEE Trans Pattern Anal Mach Intell</addtitle><description>Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table-based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and, if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency</description><subject>3D object recognition</subject><subject>3D representation</subject><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Artificial Intelligence</subject><subject>Casting</subject><subject>Cluster Analysis</subject><subject>Computer science; control theory; systems</subject><subject>Exact sciences and technology</subject><subject>geometric hashing</subject><subject>Image converters</subject><subject>Image Enhancement - methods</subject><subject>Image Interpretation, Computer-Assisted - methods</subject><subject>Image recognition</subject><subject>Image segmentation</subject><subject>Imaging, Three-Dimensional - methods</subject><subject>Information Storage and Retrieval - methods</subject><subject>Layout</subject><subject>Libraries</subject><subject>Mathematical analysis</subject><subject>Multidimensional systems</subject><subject>Multiview correspondence</subject><subject>Object recognition</subject><subject>Pattern Recognition, Automated - methods</subject><subject>Pattern recognition. Digital image processing. Computational geometry</subject><subject>Recognition</subject><subject>registration</subject><subject>Segmentation</subject><subject>shape descriptor</subject><subject>Tensile stress</subject><subject>Tensors</subject><subject>Three dimensional</subject><subject>Voting</subject><issn>0162-8828</issn><issn>1939-3539</issn><issn>2160-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><sourceid>EIF</sourceid><recordid>eNp90ctLxDAQBvAgiq6PqxdBiqCeuk7aPI-6PkFRdD2XNJlql26rSXvwvze6C4oHT4HhNx9hPkJ2KYwpBX0yfTi9uxlnAGKc0XyFjKjOdZrzXK-SEVCRpUplaoNshjADoIxDvk42qNBKcEZH5Hn66hHT83qObai71jTJXeewSc9MQJfclzO0ffKItntp6z6CxLQuecKX6HvzPajbZNIMfY8-LjxZbDFsk7XKNAF3lu8Web68mE6u09v7q5vJ6W1qGZN9qqkDa5nTzmal41Ib5QQwwapSQ5UJVWomEKjkwEotbMW4tYBKGpBUYZlvkeNF7pvv3gcMfTGvg8WmMS12QyiUFvEecT_Ko3-lUEponrMID_7AWTf4eJeYJrikVEoV0XiBrO9C8FgVb76eG_9RUCi-eim-eym-eiliL3Fhf5k6lHN0P3xZRASHS2CCNU3lTWvr8OMUSBb_GN3ewtWI-CtGSk5F_gly_p0W</recordid><startdate>20061001</startdate><enddate>20061001</enddate><creator>Mian, A.S.</creator><creator>Bennamoun, M.</creator><creator>Owens, R.</creator><general>IEEE</general><general>IEEE Computer Society</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20061001</creationdate><title>Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes</title><author>Mian, A.S. ; Bennamoun, M. ; Owens, R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c447t-91d0cc4d9dc2bd579a8d60464fb90f268b946e017504b96cf45cc0e87a0718eb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>3D object recognition</topic><topic>3D representation</topic><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Artificial Intelligence</topic><topic>Casting</topic><topic>Cluster Analysis</topic><topic>Computer science; control theory; systems</topic><topic>Exact sciences and technology</topic><topic>geometric hashing</topic><topic>Image converters</topic><topic>Image Enhancement - methods</topic><topic>Image Interpretation, Computer-Assisted - methods</topic><topic>Image recognition</topic><topic>Image segmentation</topic><topic>Imaging, Three-Dimensional - methods</topic><topic>Information Storage and Retrieval - methods</topic><topic>Layout</topic><topic>Libraries</topic><topic>Mathematical analysis</topic><topic>Multidimensional systems</topic><topic>Multiview correspondence</topic><topic>Object recognition</topic><topic>Pattern Recognition, Automated - methods</topic><topic>Pattern recognition. Digital image processing. Computational geometry</topic><topic>Recognition</topic><topic>registration</topic><topic>Segmentation</topic><topic>shape descriptor</topic><topic>Tensile stress</topic><topic>Tensors</topic><topic>Three dimensional</topic><topic>Voting</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mian, A.S.</creatorcontrib><creatorcontrib>Bennamoun, M.</creatorcontrib><creatorcontrib>Owens, R.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mian, A.S.</au><au>Bennamoun, M.</au><au>Owens, R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes</atitle><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle><stitle>TPAMI</stitle><addtitle>IEEE Trans Pattern Anal Mach Intell</addtitle><date>2006-10-01</date><risdate>2006</risdate><volume>28</volume><issue>10</issue><spage>1584</spage><epage>1601</epage><pages>1584-1601</pages><issn>0162-8828</issn><eissn>1939-3539</eissn><eissn>2160-9292</eissn><coden>ITPIDJ</coden><abstract>Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table-based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and, if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency</abstract><cop>Los Alamitos, CA</cop><pub>IEEE</pub><pmid>16986541</pmid><doi>10.1109/TPAMI.2006.213</doi><tpages>18</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0162-8828
ispartof IEEE transactions on pattern analysis and machine intelligence, 2006-10, Vol.28 (10), p.1584-1601
issn 0162-8828
1939-3539
2160-9292
language eng
recordid cdi_pubmed_primary_16986541
source IEEE Electronic Library (IEL)
subjects 3D object recognition
3D representation
Algorithms
Applied sciences
Artificial Intelligence
Casting
Cluster Analysis
Computer science
control theory
systems
Exact sciences and technology
geometric hashing
Image converters
Image Enhancement - methods
Image Interpretation, Computer-Assisted - methods
Image recognition
Image segmentation
Imaging, Three-Dimensional - methods
Information Storage and Retrieval - methods
Layout
Libraries
Mathematical analysis
Multidimensional systems
Multiview correspondence
Object recognition
Pattern Recognition, Automated - methods
Pattern recognition. Digital image processing. Computational geometry
Recognition
registration
Segmentation
shape descriptor
Tensile stress
Tensors
Three dimensional
Voting
title Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-11-29T10%3A47%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Three-Dimensional%20Model-Based%20Object%20Recognition%20and%20Segmentation%20in%20Cluttered%20Scenes&rft.jtitle=IEEE%20transactions%20on%20pattern%20analysis%20and%20machine%20intelligence&rft.au=Mian,%20A.S.&rft.date=2006-10-01&rft.volume=28&rft.issue=10&rft.spage=1584&rft.epage=1601&rft.pages=1584-1601&rft.issn=0162-8828&rft.eissn=1939-3539&rft.coden=ITPIDJ&rft_id=info:doi/10.1109/TPAMI.2006.213&rft_dat=%3Cproquest_RIE%3E896193175%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=865711778&rft_id=info:pmid/16986541&rft_ieee_id=1677516&rfr_iscdi=true