GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF

We introduce a novel, multimodal large-scale scene reconstruction benchmark that utilizes newly developed 3D representation approaches: Gaussian Splatting and Neural Radiance Fields (NeRF). Our expansive U-Scene dataset surpasses any previously existing real large-scale outdoor LiDAR and image datas...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Xiong, Butian, Zheng, Nanjun, Liu, Junhua, Li, Zhen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Xiong, Butian
Zheng, Nanjun
Liu, Junhua
Li, Zhen
description We introduce a novel, multimodal large-scale scene reconstruction benchmark that utilizes newly developed 3D representation approaches: Gaussian Splatting and Neural Radiance Fields (NeRF). Our expansive U-Scene dataset surpasses any previously existing real large-scale outdoor LiDAR and image dataset in both area and point count. GauU-Scene encompasses over 6.5 square kilometers and features a comprehensive RGB dataset coupled with LiDAR ground truth. Additionally, we are the first to propose a LiDAR and image alignment method for a drone-based dataset. Our assessment of GauU-Scene includes a detailed analysis across various novel viewpoints, employing image-based metrics such as SSIM, LPIPS, and PSNR on NeRF and Gaussian Splatting based methods. This analysis reveals contradictory results when applying geometric-based metrics like Chamfer distance. The experimental results on our multimodal dataset highlight the unreliability of current image-based metrics and reveal significant drawbacks in geometric reconstruction using the current Gaussian Splatting-based method, further illustrating the necessity of our dataset for assessing geometry reconstruction tasks. We also provide detailed supplementary information on data collection protocols and make the dataset available on the following anonymous project page
doi_str_mv 10.48550/arxiv.2404.04880
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2404_04880</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2404_04880</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-1cf0116348feb1efc6b4092c17dfcc1416184cc6c2670c8ac1337bb7a54044423</originalsourceid><addsrcrecordid>eNotkMtOwzAQRb1hgQofwIr5gQQ7dhLDrvQRKgWQ-mAbTZxxaykNVRxK-_cNKau7ObrSOYw9CB4qHcf8CduTO4aR4irkSmt-y44Z_myClaGG4Ct6gbH35L1rttDtCJZUOyxd7bozfFtY7HFLwSt6quCdutYZD7-u28HsdMDGuyNB7ipsryBMsevRDjbDn5xmK8Cmgg9azu_YjcXa0_3_jth6PltP3oL8M1tMxnmAScoDYSwXIpFKWyoFWZOUij9HRqSVNUYokQitjElM1NNGoxFSpmWZYtwbKhXJEXu83g7ixaF1e2zPxV-AYgggL0VoVJc</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF</title><source>arXiv.org</source><creator>Xiong, Butian ; Zheng, Nanjun ; Liu, Junhua ; Li, Zhen</creator><creatorcontrib>Xiong, Butian ; Zheng, Nanjun ; Liu, Junhua ; Li, Zhen</creatorcontrib><description>We introduce a novel, multimodal large-scale scene reconstruction benchmark that utilizes newly developed 3D representation approaches: Gaussian Splatting and Neural Radiance Fields (NeRF). Our expansive U-Scene dataset surpasses any previously existing real large-scale outdoor LiDAR and image dataset in both area and point count. GauU-Scene encompasses over 6.5 square kilometers and features a comprehensive RGB dataset coupled with LiDAR ground truth. Additionally, we are the first to propose a LiDAR and image alignment method for a drone-based dataset. Our assessment of GauU-Scene includes a detailed analysis across various novel viewpoints, employing image-based metrics such as SSIM, LPIPS, and PSNR on NeRF and Gaussian Splatting based methods. This analysis reveals contradictory results when applying geometric-based metrics like Chamfer distance. The experimental results on our multimodal dataset highlight the unreliability of current image-based metrics and reveal significant drawbacks in geometric reconstruction using the current Gaussian Splatting-based method, further illustrating the necessity of our dataset for assessing geometry reconstruction tasks. We also provide detailed supplementary information on data collection protocols and make the dataset available on the following anonymous project page</description><identifier>DOI: 10.48550/arxiv.2404.04880</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-04</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2404.04880$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2404.04880$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Xiong, Butian</creatorcontrib><creatorcontrib>Zheng, Nanjun</creatorcontrib><creatorcontrib>Liu, Junhua</creatorcontrib><creatorcontrib>Li, Zhen</creatorcontrib><title>GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF</title><description>We introduce a novel, multimodal large-scale scene reconstruction benchmark that utilizes newly developed 3D representation approaches: Gaussian Splatting and Neural Radiance Fields (NeRF). Our expansive U-Scene dataset surpasses any previously existing real large-scale outdoor LiDAR and image dataset in both area and point count. GauU-Scene encompasses over 6.5 square kilometers and features a comprehensive RGB dataset coupled with LiDAR ground truth. Additionally, we are the first to propose a LiDAR and image alignment method for a drone-based dataset. Our assessment of GauU-Scene includes a detailed analysis across various novel viewpoints, employing image-based metrics such as SSIM, LPIPS, and PSNR on NeRF and Gaussian Splatting based methods. This analysis reveals contradictory results when applying geometric-based metrics like Chamfer distance. The experimental results on our multimodal dataset highlight the unreliability of current image-based metrics and reveal significant drawbacks in geometric reconstruction using the current Gaussian Splatting-based method, further illustrating the necessity of our dataset for assessing geometry reconstruction tasks. We also provide detailed supplementary information on data collection protocols and make the dataset available on the following anonymous project page</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotkMtOwzAQRb1hgQofwIr5gQQ7dhLDrvQRKgWQ-mAbTZxxaykNVRxK-_cNKau7ObrSOYw9CB4qHcf8CduTO4aR4irkSmt-y44Z_myClaGG4Ct6gbH35L1rttDtCJZUOyxd7bozfFtY7HFLwSt6quCdutYZD7-u28HsdMDGuyNB7ipsryBMsevRDjbDn5xmK8Cmgg9azu_YjcXa0_3_jth6PltP3oL8M1tMxnmAScoDYSwXIpFKWyoFWZOUij9HRqSVNUYokQitjElM1NNGoxFSpmWZYtwbKhXJEXu83g7ixaF1e2zPxV-AYgggL0VoVJc</recordid><startdate>20240407</startdate><enddate>20240407</enddate><creator>Xiong, Butian</creator><creator>Zheng, Nanjun</creator><creator>Liu, Junhua</creator><creator>Li, Zhen</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240407</creationdate><title>GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF</title><author>Xiong, Butian ; Zheng, Nanjun ; Liu, Junhua ; Li, Zhen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-1cf0116348feb1efc6b4092c17dfcc1416184cc6c2670c8ac1337bb7a54044423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Xiong, Butian</creatorcontrib><creatorcontrib>Zheng, Nanjun</creatorcontrib><creatorcontrib>Liu, Junhua</creatorcontrib><creatorcontrib>Li, Zhen</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xiong, Butian</au><au>Zheng, Nanjun</au><au>Liu, Junhua</au><au>Li, Zhen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF</atitle><date>2024-04-07</date><risdate>2024</risdate><abstract>We introduce a novel, multimodal large-scale scene reconstruction benchmark that utilizes newly developed 3D representation approaches: Gaussian Splatting and Neural Radiance Fields (NeRF). Our expansive U-Scene dataset surpasses any previously existing real large-scale outdoor LiDAR and image dataset in both area and point count. GauU-Scene encompasses over 6.5 square kilometers and features a comprehensive RGB dataset coupled with LiDAR ground truth. Additionally, we are the first to propose a LiDAR and image alignment method for a drone-based dataset. Our assessment of GauU-Scene includes a detailed analysis across various novel viewpoints, employing image-based metrics such as SSIM, LPIPS, and PSNR on NeRF and Gaussian Splatting based methods. This analysis reveals contradictory results when applying geometric-based metrics like Chamfer distance. The experimental results on our multimodal dataset highlight the unreliability of current image-based metrics and reveal significant drawbacks in geometric reconstruction using the current Gaussian Splatting-based method, further illustrating the necessity of our dataset for assessing geometry reconstruction tasks. We also provide detailed supplementary information on data collection protocols and make the dataset available on the following anonymous project page</abstract><doi>10.48550/arxiv.2404.04880</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2404.04880
ispartof
issn
language eng
recordid cdi_arxiv_primary_2404_04880
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title GauU-Scene V2: Assessing the Reliability of Image-Based Metrics with Expansive Lidar Image Dataset Using 3DGS and NeRF
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T03%3A30%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=GauU-Scene%20V2:%20Assessing%20the%20Reliability%20of%20Image-Based%20Metrics%20with%20Expansive%20Lidar%20Image%20Dataset%20Using%203DGS%20and%20NeRF&rft.au=Xiong,%20Butian&rft.date=2024-04-07&rft_id=info:doi/10.48550/arxiv.2404.04880&rft_dat=%3Carxiv_GOX%3E2404_04880%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true