AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Diffusion-based policies have shown impressive performance in robotic manipulation tasks while struggling with out-of-domain distributions. Recent efforts attempted to enhance generalization by improving the visual feature encoding for diffusion policy. However, their generalization is typically lim...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Diffusion-based policies have shown impressive performance in robotic
manipulation tasks while struggling with out-of-domain distributions. Recent
efforts attempted to enhance generalization by improving the visual feature
encoding for diffusion policy. However, their generalization is typically
limited to the same category with similar appearances. Our key insight is that
leveraging affordances--manipulation priors that define "where" and "how" an
agent interacts with an object--can substantially enhance generalization to
entirely unseen object instances and categories. We introduce the Diffusion
Policy with transferable Affordance (AffordDP), designed for generalizable
manipulation across novel categories. AffordDP models affordances through 3D
contact points and post-contact trajectories, capturing the essential static
and dynamic information for complex tasks. The transferable affordance from
in-domain data to unseen objects is achieved by estimating a 6D transformation
matrix using foundational vision models and point cloud registration
techniques. More importantly, we incorporate affordance guidance during
diffusion sampling that can refine action sequence generation. This guidance
directs the generated action to gradually move towards the desired manipulation
for unseen objects while keeping the generated action within the manifold of
action space. Experimental results from both simulated and real-world
environments demonstrate that AffordDP consistently outperforms previous
diffusion-based methods, successfully generalizing to unseen instances and
categories where others fail. |
---|---|
DOI: | 10.48550/arxiv.2412.03142 |