xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
Reusing pre-collected data from different domains is an appealing solution for decision-making tasks that have insufficient data in the target domain but are relatively abundant in other related domains. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or c...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Reusing pre-collected data from different domains is an appealing solution
for decision-making tasks that have insufficient data in the target domain but
are relatively abundant in other related domains. Existing cross-domain policy
transfer methods mostly aim at learning domain correspondences or corrections
to facilitate policy learning, such as learning domain/task-specific
discriminators, representations, or policies. This design philosophy often
results in heavy model architectures or task/domain-specific modeling, lacking
flexibility. This reality makes us wonder: can we directly bridge the domain
gaps universally at the data level, instead of relying on complex downstream
cross-domain policy transfer models? In this study, we propose the Cross-Domain
Trajectory EDiting (xTED) framework that employs a specially designed diffusion
model for cross-domain trajectory adaptation. Our proposed model architecture
effectively captures the intricate dependencies among states, actions, and
rewards, as well as the dynamics patterns within target data. By utilizing the
pre-trained diffusion as a prior, source domain trajectories can be transformed
to match with target domain properties while preserving original semantic
information. This process implicitly corrects underlying domain gaps, enhancing
state realism and dynamics reliability in the source data, and allowing
flexible incorporation with various downstream policy learning methods. Despite
its simplicity, xTED demonstrates superior performance in extensive simulation
and real-robot experiments. |
---|---|
DOI: | 10.48550/arxiv.2409.08687 |