XBG: End-to-End Imitation Learning for Autonomous Behaviour in Human-Robot Interaction and Collaboration

This letter presents XBG (eXteroceptive Behaviour Generation), a multimodal end-to-end Imitation Learning (IL) system for whole-body autonomous humanoid robots used in real-world Human-Robot Interaction (HRI) scenarios. The main contribution is an architecture for learning HRI behaviours using a dat...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE robotics and automation letters 2024-12, Vol.9 (12), p.11617-11624
Hauptverfasser:	Cardenas-Perez, Carlos, Romualdi, Giulio, Elobaid, Mohamed, Dafarra, Stefano, L'Erario, Giuseppe, Traversaro, Silvio, Morerio, Pietro, Bue, Alessio Del, Pucci, Daniele
Format:	Artikel
Sprache:	eng
Schlagworte:	AI-enabled robotics Artificial neural networks Feature extraction Human engineering Human-robot interaction Humanoid humanoid robot systems Humanoid robots Imitation learning learning from demonstration Legged locomotion Machine learning Robot learning Robot sensing systems Robots Sensors Solid modeling Synchronism Transformers Walking
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This letter presents XBG (eXteroceptive Behaviour Generation), a multimodal end-to-end Imitation Learning (IL) system for whole-body autonomous humanoid robots used in real-world Human-Robot Interaction (HRI) scenarios. The main contribution is an architecture for learning HRI behaviours using a data-driven approach. A diverse dataset is collected via teleoperation, covering multiple HRI scenarios, such as handshaking, handwaving, payload reception, walking, and walking with a payload. After synchronizing, filtering, and transforming the data, we show how to train the presented Deep Neural Networks (DNN), integrating exteroceptive and proprioceptive information to help the robot understand both its environment and its actions. The robot takes in sequences of images (RGB and depth) and joints state information to react accordingly. By fusing multimodal signals over time, the model enables autonomous capabilities in a robotic platform. The models are evaluated based on the success rates in the mentioned HRI scenarios and they are deployed on the ergoCub humanoid robot. XBG achieves success rates between 60% and 100% even when tested in unseen environments.
ISSN:	2377-3766 2377-3766
DOI:	10.1109/LRA.2024.3495577