xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing

Reusing pre-collected data from different domains is an appealing solution for decision-making tasks that have insufficient data in the target domain but are relatively abundant in other related domains. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or c...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Niu, Haoyi, Chen, Qimao, Liu, Tenglong, Li, Jianxiong, Zhou, Guyue, Zhang, Yi, Hu, Jianming, Zhan, Xianyuan
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Learning Computer Science - Robotics
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Niu, Haoyi Chen, Qimao Liu, Tenglong Li, Jianxiong Zhou, Guyue Zhang, Yi Hu, Jianming Zhan, Xianyuan
description	Reusing pre-collected data from different domains is an appealing solution for decision-making tasks that have insufficient data in the target domain but are relatively abundant in other related domains. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or corrections to facilitate policy learning, such as learning domain/task-specific discriminators, representations, or policies. This design philosophy often results in heavy model architectures or task/domain-specific modeling, lacking flexibility. This reality makes us wonder: can we directly bridge the domain gaps universally at the data level, instead of relying on complex downstream cross-domain policy transfer models? In this study, we propose the Cross-Domain Trajectory EDiting (xTED) framework that employs a specially designed diffusion model for cross-domain trajectory adaptation. Our proposed model architecture effectively captures the intricate dependencies among states, actions, and rewards, as well as the dynamics patterns within target data. By utilizing the pre-trained diffusion as a prior, source domain trajectories can be transformed to match with target domain properties while preserving original semantic information. This process implicitly corrects underlying domain gaps, enhancing state realism and dynamics reliability in the source data, and allowing flexible incorporation with various downstream policy learning methods. Despite its simplicity, xTED demonstrates superior performance in extensive simulation and real-robot experiments.
doi_str_mv	10.48550/arxiv.2409.08687
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_08687</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_08687</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_086873</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DOwMLMw52RwqQhxdbFScC7KLy7WdcnPTczMU3BMSSwoSSzJzM9TKMtMVHDJTEsrLQbydJ0Si1NTFEKKErNSk0vyiyoVXFMySzLz0nkYWNMSc4pTeaE0N4O8m2uIs4cu2L74gqLM3MSiyniQvfFge40JqwAAnIM3vQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing</title><source>arXiv.org</source><creator>Niu, Haoyi ; Chen, Qimao ; Liu, Tenglong ; Li, Jianxiong ; Zhou, Guyue ; Zhang, Yi ; Hu, Jianming ; Zhan, Xianyuan</creator><creatorcontrib>Niu, Haoyi ; Chen, Qimao ; Liu, Tenglong ; Li, Jianxiong ; Zhou, Guyue ; Zhang, Yi ; Hu, Jianming ; Zhan, Xianyuan</creatorcontrib><description>Reusing pre-collected data from different domains is an appealing solution for decision-making tasks that have insufficient data in the target domain but are relatively abundant in other related domains. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or corrections to facilitate policy learning, such as learning domain/task-specific discriminators, representations, or policies. This design philosophy often results in heavy model architectures or task/domain-specific modeling, lacking flexibility. This reality makes us wonder: can we directly bridge the domain gaps universally at the data level, instead of relying on complex downstream cross-domain policy transfer models? In this study, we propose the Cross-Domain Trajectory EDiting (xTED) framework that employs a specially designed diffusion model for cross-domain trajectory adaptation. Our proposed model architecture effectively captures the intricate dependencies among states, actions, and rewards, as well as the dynamics patterns within target data. By utilizing the pre-trained diffusion as a prior, source domain trajectories can be transformed to match with target domain properties while preserving original semantic information. This process implicitly corrects underlying domain gaps, enhancing state realism and dynamics reliability in the source data, and allowing flexible incorporation with various downstream policy learning methods. Despite its simplicity, xTED demonstrates superior performance in extensive simulation and real-robot experiments.</description><identifier>DOI: 10.48550/arxiv.2409.08687</identifier><language>eng</language><subject>Computer Science - Learning ; Computer Science - Robotics</subject><creationdate>2024-09</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.08687$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.08687$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Niu, Haoyi</creatorcontrib><creatorcontrib>Chen, Qimao</creatorcontrib><creatorcontrib>Liu, Tenglong</creatorcontrib><creatorcontrib>Li, Jianxiong</creatorcontrib><creatorcontrib>Zhou, Guyue</creatorcontrib><creatorcontrib>Zhang, Yi</creatorcontrib><creatorcontrib>Hu, Jianming</creatorcontrib><creatorcontrib>Zhan, Xianyuan</creatorcontrib><title>xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing</title><description>Reusing pre-collected data from different domains is an appealing solution for decision-making tasks that have insufficient data in the target domain but are relatively abundant in other related domains. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or corrections to facilitate policy learning, such as learning domain/task-specific discriminators, representations, or policies. This design philosophy often results in heavy model architectures or task/domain-specific modeling, lacking flexibility. This reality makes us wonder: can we directly bridge the domain gaps universally at the data level, instead of relying on complex downstream cross-domain policy transfer models? In this study, we propose the Cross-Domain Trajectory EDiting (xTED) framework that employs a specially designed diffusion model for cross-domain trajectory adaptation. Our proposed model architecture effectively captures the intricate dependencies among states, actions, and rewards, as well as the dynamics patterns within target data. By utilizing the pre-trained diffusion as a prior, source domain trajectories can be transformed to match with target domain properties while preserving original semantic information. This process implicitly corrects underlying domain gaps, enhancing state realism and dynamics reliability in the source data, and allowing flexible incorporation with various downstream policy learning methods. Despite its simplicity, xTED demonstrates superior performance in extensive simulation and real-robot experiments.</description><subject>Computer Science - Learning</subject><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DOwMLMw52RwqQhxdbFScC7KLy7WdcnPTczMU3BMSSwoSSzJzM9TKMtMVHDJTEsrLQbydJ0Si1NTFEKKErNSk0vyiyoVXFMySzLz0nkYWNMSc4pTeaE0N4O8m2uIs4cu2L74gqLM3MSiyniQvfFge40JqwAAnIM3vQ</recordid><startdate>20240913</startdate><enddate>20240913</enddate><creator>Niu, Haoyi</creator><creator>Chen, Qimao</creator><creator>Liu, Tenglong</creator><creator>Li, Jianxiong</creator><creator>Zhou, Guyue</creator><creator>Zhang, Yi</creator><creator>Hu, Jianming</creator><creator>Zhan, Xianyuan</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240913</creationdate><title>xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing</title><author>Niu, Haoyi ; Chen, Qimao ; Liu, Tenglong ; Li, Jianxiong ; Zhou, Guyue ; Zhang, Yi ; Hu, Jianming ; Zhan, Xianyuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_086873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Learning</topic><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Niu, Haoyi</creatorcontrib><creatorcontrib>Chen, Qimao</creatorcontrib><creatorcontrib>Liu, Tenglong</creatorcontrib><creatorcontrib>Li, Jianxiong</creatorcontrib><creatorcontrib>Zhou, Guyue</creatorcontrib><creatorcontrib>Zhang, Yi</creatorcontrib><creatorcontrib>Hu, Jianming</creatorcontrib><creatorcontrib>Zhan, Xianyuan</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Niu, Haoyi</au><au>Chen, Qimao</au><au>Liu, Tenglong</au><au>Li, Jianxiong</au><au>Zhou, Guyue</au><au>Zhang, Yi</au><au>Hu, Jianming</au><au>Zhan, Xianyuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing</atitle><date>2024-09-13</date><risdate>2024</risdate><abstract>Reusing pre-collected data from different domains is an appealing solution for decision-making tasks that have insufficient data in the target domain but are relatively abundant in other related domains. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or corrections to facilitate policy learning, such as learning domain/task-specific discriminators, representations, or policies. This design philosophy often results in heavy model architectures or task/domain-specific modeling, lacking flexibility. This reality makes us wonder: can we directly bridge the domain gaps universally at the data level, instead of relying on complex downstream cross-domain policy transfer models? In this study, we propose the Cross-Domain Trajectory EDiting (xTED) framework that employs a specially designed diffusion model for cross-domain trajectory adaptation. Our proposed model architecture effectively captures the intricate dependencies among states, actions, and rewards, as well as the dynamics patterns within target data. By utilizing the pre-trained diffusion as a prior, source domain trajectories can be transformed to match with target domain properties while preserving original semantic information. This process implicitly corrects underlying domain gaps, enhancing state realism and dynamics reliability in the source data, and allowing flexible incorporation with various downstream policy learning methods. Despite its simplicity, xTED demonstrates superior performance in extensive simulation and real-robot experiments.</abstract><doi>10.48550/arxiv.2409.08687</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2409.08687
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2409_08687
source	arXiv.org
subjects	Computer Science - Learning Computer Science - Robotics
title	xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T06%3A19%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=xTED:%20Cross-Domain%20Adaptation%20via%20Diffusion-Based%20Trajectory%20Editing&rft.au=Niu,%20Haoyi&rft.date=2024-09-13&rft_id=info:doi/10.48550/arxiv.2409.08687&rft_dat=%3Carxiv_GOX%3E2409_08687%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true