FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation

6D object pose estimation involves determining the three-dimensional translation and rotation of an object within a scene and relative to a chosen coordinate system. This problem is of particular interest for many practical applications in industrial tasks such as quality control, bin picking, and r...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-09
Hauptverfasser:	Pöllabauer, Thomas, Ashwin Pramod, Knauthe, Volker, Wahl, Michael
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Configuration management Coordinates Deep learning Industrial applications Inference Knowledge management Pose estimation Quality control Robot control
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Pöllabauer, Thomas Ashwin Pramod Knauthe, Volker Wahl, Michael
description	6D object pose estimation involves determining the three-dimensional translation and rotation of an object within a scene and relative to a chosen coordinate system. This problem is of particular interest for many practical applications in industrial tasks such as quality control, bin picking, and robotic manipulation, where both speed and accuracy are critical for real-world deployment. Current models, both classical and deep-learning-based, often struggle with the trade-off between accuracy and latency. Our research focuses on enhancing the speed of a prominent state-of-the-art deep learning model, GDRNPP, while keeping its high accuracy. We employ several techniques to reduce the model size and improve inference time. These techniques include using smaller and quicker backbones, pruning unnecessary parameters, and distillation to transfer knowledge from a large, high-performing model to a smaller, more efficient student model. Our findings demonstrate that the proposed configuration maintains accuracy comparable to the state-of-the-art while significantly improving inference time. This advancement could lead to more efficient and practical applications in various industrial scenarios, thereby enhancing the overall applicability of 6D Object Pose Estimation models in real-world settings.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3107311742</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3107311742</sourcerecordid><originalsourceid>FETCH-proquest_journals_31073117423</originalsourceid><addsrcrecordid>eNqNjM0KgkAURocgSMp3uNB6YJzxJ9pJarUpSfdidS0lHXOuPX8ueoBWH5xz-GbMkko5fONKuWC2MY0QQvqB9DxlsVMSZjnso8spTbdwbPtBf-ruAfREyHrEO-gKMioJua74RHk4EPgRnK8N3ghSbRBiQ3VbUq27FZtX5cug_dslWydxvjvw6fc9oqGi0ePQTapQjgiU4wSuVP9VX19hO4w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3107311742</pqid></control><display><type>article</type><title>FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation</title><source>Free E- Journals</source><creator>Pöllabauer, Thomas ; Ashwin Pramod ; Knauthe, Volker ; Wahl, Michael</creator><creatorcontrib>Pöllabauer, Thomas ; Ashwin Pramod ; Knauthe, Volker ; Wahl, Michael</creatorcontrib><description>6D object pose estimation involves determining the three-dimensional translation and rotation of an object within a scene and relative to a chosen coordinate system. This problem is of particular interest for many practical applications in industrial tasks such as quality control, bin picking, and robotic manipulation, where both speed and accuracy are critical for real-world deployment. Current models, both classical and deep-learning-based, often struggle with the trade-off between accuracy and latency. Our research focuses on enhancing the speed of a prominent state-of-the-art deep learning model, GDRNPP, while keeping its high accuracy. We employ several techniques to reduce the model size and improve inference time. These techniques include using smaller and quicker backbones, pruning unnecessary parameters, and distillation to transfer knowledge from a large, high-performing model to a smaller, more efficient student model. Our findings demonstrate that the proposed configuration maintains accuracy comparable to the state-of-the-art while significantly improving inference time. This advancement could lead to more efficient and practical applications in various industrial scenarios, thereby enhancing the overall applicability of 6D Object Pose Estimation models in real-world settings.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Accuracy ; Configuration management ; Coordinates ; Deep learning ; Industrial applications ; Inference ; Knowledge management ; Pose estimation ; Quality control ; Robot control</subject><ispartof>arXiv.org, 2024-09</ispartof><rights>2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Pöllabauer, Thomas</creatorcontrib><creatorcontrib>Ashwin Pramod</creatorcontrib><creatorcontrib>Knauthe, Volker</creatorcontrib><creatorcontrib>Wahl, Michael</creatorcontrib><title>FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation</title><title>arXiv.org</title><description>6D object pose estimation involves determining the three-dimensional translation and rotation of an object within a scene and relative to a chosen coordinate system. This problem is of particular interest for many practical applications in industrial tasks such as quality control, bin picking, and robotic manipulation, where both speed and accuracy are critical for real-world deployment. Current models, both classical and deep-learning-based, often struggle with the trade-off between accuracy and latency. Our research focuses on enhancing the speed of a prominent state-of-the-art deep learning model, GDRNPP, while keeping its high accuracy. We employ several techniques to reduce the model size and improve inference time. These techniques include using smaller and quicker backbones, pruning unnecessary parameters, and distillation to transfer knowledge from a large, high-performing model to a smaller, more efficient student model. Our findings demonstrate that the proposed configuration maintains accuracy comparable to the state-of-the-art while significantly improving inference time. This advancement could lead to more efficient and practical applications in various industrial scenarios, thereby enhancing the overall applicability of 6D Object Pose Estimation models in real-world settings.</description><subject>Accuracy</subject><subject>Configuration management</subject><subject>Coordinates</subject><subject>Deep learning</subject><subject>Industrial applications</subject><subject>Inference</subject><subject>Knowledge management</subject><subject>Pose estimation</subject><subject>Quality control</subject><subject>Robot control</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjM0KgkAURocgSMp3uNB6YJzxJ9pJarUpSfdidS0lHXOuPX8ueoBWH5xz-GbMkko5fONKuWC2MY0QQvqB9DxlsVMSZjnso8spTbdwbPtBf-ruAfREyHrEO-gKMioJua74RHk4EPgRnK8N3ghSbRBiQ3VbUq27FZtX5cug_dslWydxvjvw6fc9oqGi0ePQTapQjgiU4wSuVP9VX19hO4w</recordid><startdate>20240918</startdate><enddate>20240918</enddate><creator>Pöllabauer, Thomas</creator><creator>Ashwin Pramod</creator><creator>Knauthe, Volker</creator><creator>Wahl, Michael</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240918</creationdate><title>FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation</title><author>Pöllabauer, Thomas ; Ashwin Pramod ; Knauthe, Volker ; Wahl, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_31073117423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Configuration management</topic><topic>Coordinates</topic><topic>Deep learning</topic><topic>Industrial applications</topic><topic>Inference</topic><topic>Knowledge management</topic><topic>Pose estimation</topic><topic>Quality control</topic><topic>Robot control</topic><toplevel>online_resources</toplevel><creatorcontrib>Pöllabauer, Thomas</creatorcontrib><creatorcontrib>Ashwin Pramod</creatorcontrib><creatorcontrib>Knauthe, Volker</creatorcontrib><creatorcontrib>Wahl, Michael</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pöllabauer, Thomas</au><au>Ashwin Pramod</au><au>Knauthe, Volker</au><au>Wahl, Michael</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation</atitle><jtitle>arXiv.org</jtitle><date>2024-09-18</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>6D object pose estimation involves determining the three-dimensional translation and rotation of an object within a scene and relative to a chosen coordinate system. This problem is of particular interest for many practical applications in industrial tasks such as quality control, bin picking, and robotic manipulation, where both speed and accuracy are critical for real-world deployment. Current models, both classical and deep-learning-based, often struggle with the trade-off between accuracy and latency. Our research focuses on enhancing the speed of a prominent state-of-the-art deep learning model, GDRNPP, while keeping its high accuracy. We employ several techniques to reduce the model size and improve inference time. These techniques include using smaller and quicker backbones, pruning unnecessary parameters, and distillation to transfer knowledge from a large, high-performing model to a smaller, more efficient student model. Our findings demonstrate that the proposed configuration maintains accuracy comparable to the state-of-the-art while significantly improving inference time. This advancement could lead to more efficient and practical applications in various industrial scenarios, thereby enhancing the overall applicability of 6D Object Pose Estimation models in real-world settings.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-09
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_3107311742
source	Free E- Journals
subjects	Accuracy Configuration management Coordinates Deep learning Industrial applications Inference Knowledge management Pose estimation Quality control Robot control
title	FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T18%3A29%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=FAST%20GDRNPP:%20Improving%20the%20Speed%20of%20State-of-the-Art%206D%20Object%20Pose%20Estimation&rft.jtitle=arXiv.org&rft.au=P%C3%B6llabauer,%20Thomas&rft.date=2024-09-18&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3107311742%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3107311742&rft_id=info:pmid/&rfr_iscdi=true