Graspability-Aware Object Pose Estimation in Cluttered Scenes

Object recognition and pose estimation are critical components in autonomous robot manipulation systems, playing a crucial role in enabling robots to interact effectively with the environment. During actual execution, the robot must recognize the object in the current scene, estimate its pose, and t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE robotics and automation letters 2024-04, Vol.9 (4), p.3124-3130
Hauptverfasser:	Hoang, Dinh-Cuong, Nguyen, Anh-Nhat, Vu, Van-Duc, Nguyen, Thu-Uyen, Vu, Duy-Quang, Ngo, Phuc-Quan, Hoang, Ngoc-Anh, Phan, Khanh-Toan, Tran, Duc-Thanh, Nguyen, Van-Thiep, Duong, Quang-Tri, Ho, Ngoc-Trung, Tran, Cong-Trinh, Duong, Van-Hiep, Mai, Anh-Truong
Format:	Artikel
Sprache:	eng
Schlagworte:	6D object pose estimation Color imagery Critical components Feature extraction Fingers Geometry grasp detection Modules Object recognition Point cloud compression Pose estimation Robot arms robot manipulation Robot sensing systems Robots Solid modeling System effectiveness Three-dimensional displays
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Object recognition and pose estimation are critical components in autonomous robot manipulation systems, playing a crucial role in enabling robots to interact effectively with the environment. During actual execution, the robot must recognize the object in the current scene, estimate its pose, and then select a feasible grasp pose from the pre-defined grasp configurations. While most existing methods primarily focus on pose estimation, they often neglect the graspability and reachability aspects. This oversight can lead to inefficiencies and failures during execution. In this study, we introduce an innovative graspability-aware object pose estimation framework. Our proposed approach not only estimates the poses of multiple objects in clustered scenes but also identifies graspable areas. This enables the system to concentrate its efforts on specific points or regions of an object that are suitable for grasping. It leverages both depth and color images to extract geometric and appearance features. To effectively combine these diverse features, we have developed an adaptive fusion module. In addition, the fused features are further enhanced through a graspability-aware feature enhancement module. The key innovation of our method lies in improving the discriminability and robustness of the features used for object pose estimation. We have achieved state-of-the-art results on public datasets when compared to several baseline methods. In real robot experiments conducted on a Franka Emika robot arm equipped with an Intel Realsense camera and a two-finger gripper, we consistently achieved high success rates, even in cluttered scenes.
ISSN:	2377-3766 2377-3766
DOI:	10.1109/LRA.2024.3364451