VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Li, Hao, Li, Jingfeng, Zhang, Dingwen, Wu, Chenming, Shi, Jieqi, Zhao, Chen, Feng, Haocheng, Ding, Errui, Wang, Jingdong, Han, Junwei
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Li, Hao Li, Jingfeng Zhang, Dingwen Wu, Chenming Shi, Jieqi Zhao, Chen Feng, Haocheng Ding, Errui Wang, Jingdong Han, Junwei
description	Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper addresses this issue by integrating self-supervised VO into our pose-free dynamic Gaussian method (VDG) to boost pose and depth initialization and static-dynamic decomposition. Moreover, VDG can work with only RGB image input and construct dynamic scenes at a faster speed and larger scenes compared with the pose-free dynamic view-synthesis method. We demonstrate the robustness of our approach via extensive quantitative and qualitative experiments. Our results show favorable performance over the state-of-the-art dynamic view synthesis methods. Additional video and source code will be posted on our project page at https://3d-aigc.github.io/VDG.
doi_str_mv	10.48550/arxiv.2406.18198
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2406_18198</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2406_18198</sourcerecordid><originalsourceid>FETCH-LOGICAL-a678-4b3acecbc24f837eb5135b5f62082d6aecbcedf0c8114643fef0e3e252727e063</originalsourceid><addsrcrecordid>eNotj8tuwjAURL1hUQEf0BX-gaR-x-0OEZoiIbEAsY1ujI2ulBjkFNT8PY-ymsWMjuYQ8s5ZrqzW7APSH15zoZjJueWf9o3YfVl90T32eIrZJrYDLYcIHTpawaXvESINp0TLhFeMR7rF7tLC7308IaMAbe-nrxyT3fdyt_jJ1ptqtZivMzCFzVQjwXnXOKGClYVvNJe60cEIZsXBwKPyh8Cc5VwZJYMPzEsvtChE4ZmRYzL7xz6v1-eEHaShfijUTwV5A1wSQOM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>VDG: Vision-Only Dynamic Gaussian for Driving Simulation</title><source>arXiv.org</source><creator>Li, Hao ; Li, Jingfeng ; Zhang, Dingwen ; Wu, Chenming ; Shi, Jieqi ; Zhao, Chen ; Feng, Haocheng ; Ding, Errui ; Wang, Jingdong ; Han, Junwei</creator><creatorcontrib>Li, Hao ; Li, Jingfeng ; Zhang, Dingwen ; Wu, Chenming ; Shi, Jieqi ; Zhao, Chen ; Feng, Haocheng ; Ding, Errui ; Wang, Jingdong ; Han, Junwei</creatorcontrib><description>Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper addresses this issue by integrating self-supervised VO into our pose-free dynamic Gaussian method (VDG) to boost pose and depth initialization and static-dynamic decomposition. Moreover, VDG can work with only RGB image input and construct dynamic scenes at a faster speed and larger scenes compared with the pose-free dynamic view-synthesis method. We demonstrate the robustness of our approach via extensive quantitative and qualitative experiments. Our results show favorable performance over the state-of-the-art dynamic view synthesis methods. Additional video and source code will be posted on our project page at https://3d-aigc.github.io/VDG.</description><identifier>DOI: 10.48550/arxiv.2406.18198</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-06</creationdate><rights>http://creativecommons.org/licenses/by-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2406.18198$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2406.18198$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Li, Hao</creatorcontrib><creatorcontrib>Li, Jingfeng</creatorcontrib><creatorcontrib>Zhang, Dingwen</creatorcontrib><creatorcontrib>Wu, Chenming</creatorcontrib><creatorcontrib>Shi, Jieqi</creatorcontrib><creatorcontrib>Zhao, Chen</creatorcontrib><creatorcontrib>Feng, Haocheng</creatorcontrib><creatorcontrib>Ding, Errui</creatorcontrib><creatorcontrib>Wang, Jingdong</creatorcontrib><creatorcontrib>Han, Junwei</creatorcontrib><title>VDG: Vision-Only Dynamic Gaussian for Driving Simulation</title><description>Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper addresses this issue by integrating self-supervised VO into our pose-free dynamic Gaussian method (VDG) to boost pose and depth initialization and static-dynamic decomposition. Moreover, VDG can work with only RGB image input and construct dynamic scenes at a faster speed and larger scenes compared with the pose-free dynamic view-synthesis method. We demonstrate the robustness of our approach via extensive quantitative and qualitative experiments. Our results show favorable performance over the state-of-the-art dynamic view synthesis methods. Additional video and source code will be posted on our project page at https://3d-aigc.github.io/VDG.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tuwjAURL1hUQEf0BX-gaR-x-0OEZoiIbEAsY1ujI2ulBjkFNT8PY-ymsWMjuYQ8s5ZrqzW7APSH15zoZjJueWf9o3YfVl90T32eIrZJrYDLYcIHTpawaXvESINp0TLhFeMR7rF7tLC7308IaMAbe-nrxyT3fdyt_jJ1ptqtZivMzCFzVQjwXnXOKGClYVvNJe60cEIZsXBwKPyh8Cc5VwZJYMPzEsvtChE4ZmRYzL7xz6v1-eEHaShfijUTwV5A1wSQOM</recordid><startdate>20240626</startdate><enddate>20240626</enddate><creator>Li, Hao</creator><creator>Li, Jingfeng</creator><creator>Zhang, Dingwen</creator><creator>Wu, Chenming</creator><creator>Shi, Jieqi</creator><creator>Zhao, Chen</creator><creator>Feng, Haocheng</creator><creator>Ding, Errui</creator><creator>Wang, Jingdong</creator><creator>Han, Junwei</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240626</creationdate><title>VDG: Vision-Only Dynamic Gaussian for Driving Simulation</title><author>Li, Hao ; Li, Jingfeng ; Zhang, Dingwen ; Wu, Chenming ; Shi, Jieqi ; Zhao, Chen ; Feng, Haocheng ; Ding, Errui ; Wang, Jingdong ; Han, Junwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a678-4b3acecbc24f837eb5135b5f62082d6aecbcedf0c8114643fef0e3e252727e063</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Li, Hao</creatorcontrib><creatorcontrib>Li, Jingfeng</creatorcontrib><creatorcontrib>Zhang, Dingwen</creatorcontrib><creatorcontrib>Wu, Chenming</creatorcontrib><creatorcontrib>Shi, Jieqi</creatorcontrib><creatorcontrib>Zhao, Chen</creatorcontrib><creatorcontrib>Feng, Haocheng</creatorcontrib><creatorcontrib>Ding, Errui</creatorcontrib><creatorcontrib>Wang, Jingdong</creatorcontrib><creatorcontrib>Han, Junwei</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Li, Hao</au><au>Li, Jingfeng</au><au>Zhang, Dingwen</au><au>Wu, Chenming</au><au>Shi, Jieqi</au><au>Zhao, Chen</au><au>Feng, Haocheng</au><au>Ding, Errui</au><au>Wang, Jingdong</au><au>Han, Junwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>VDG: Vision-Only Dynamic Gaussian for Driving Simulation</atitle><date>2024-06-26</date><risdate>2024</risdate><abstract>Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper addresses this issue by integrating self-supervised VO into our pose-free dynamic Gaussian method (VDG) to boost pose and depth initialization and static-dynamic decomposition. Moreover, VDG can work with only RGB image input and construct dynamic scenes at a faster speed and larger scenes compared with the pose-free dynamic view-synthesis method. We demonstrate the robustness of our approach via extensive quantitative and qualitative experiments. Our results show favorable performance over the state-of-the-art dynamic view synthesis methods. Additional video and source code will be posted on our project page at https://3d-aigc.github.io/VDG.</abstract><doi>10.48550/arxiv.2406.18198</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2406.18198
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2406_18198
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	VDG: Vision-Only Dynamic Gaussian for Driving Simulation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T05%3A51%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=VDG:%20Vision-Only%20Dynamic%20Gaussian%20for%20Driving%20Simulation&rft.au=Li,%20Hao&rft.date=2024-06-26&rft_id=info:doi/10.48550/arxiv.2406.18198&rft_dat=%3Carxiv_GOX%3E2406_18198%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true