M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes
Recent advances in diffusion models have opened new avenues for research into embodied AI agents and robotics. Despite significant achievements in complex robotic locomotion and skills, mobile manipulation-a capability that requires the coordination of navigation and manipulation-remains a challenge...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Recent advances in diffusion models have opened new avenues for research into
embodied AI agents and robotics. Despite significant achievements in complex
robotic locomotion and skills, mobile manipulation-a capability that requires
the coordination of navigation and manipulation-remains a challenge for
generative AI techniques. This is primarily due to the high-dimensional action
space, extended motion trajectories, and interactions with the surrounding
environment. In this paper, we introduce M2Diffuser, a diffusion-based,
scene-conditioned generative model that directly generates coordinated and
efficient whole-body motion trajectories for mobile manipulation based on
robot-centric 3D scans. M2Diffuser first learns trajectory-level distributions
from mobile manipulation trajectories provided by an expert planner. Crucially,
it incorporates an optimization module that can flexibly accommodate physical
constraints and task objectives, modeled as cost and energy functions, during
the inference process. This enables the reduction of physical violations and
execution errors at each denoising step in a fully differentiable manner.
Through benchmarking on three types of mobile manipulation tasks across over 20
scenes, we demonstrate that M2Diffuser outperforms state-of-the-art neural
planners and successfully transfers the generated trajectories to a real-world
robot. Our evaluations underscore the potential of generative AI to enhance the
generalization of traditional planning and learning-based robotic methods,
while also highlighting the critical role of enforcing physical constraints for
safe and robust execution. |
---|---|
DOI: | 10.48550/arxiv.2410.11402 |