Generating LOD3 building models from structure-from-motion and semantic segmentation

•Novel pipeline for automatically reconstruct LOD3 models of free-standing buildings.•New metrics for evaluating the performance of a LOD reconstruction.•Benchmark dataset for evaluating the reconstruction of SfM as well as LOD3 models.•Deep learning models for facade and opening segmentation of fre...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Automation in construction 2022-09, Vol.141, p.104430, Article 104430
Hauptverfasser:	Pantoja-Rosero, B.G., Achanta, R., Kozinski, M., Fua, P., Perez-Cruz, F., Beyer, K.
Format:	Artikel
Sprache:	eng
Schlagworte:	3D building models Deep learning Digital twin LOD models Masonry buildings Structure from motion
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•Novel pipeline for automatically reconstruct LOD3 models of free-standing buildings.•New metrics for evaluating the performance of a LOD reconstruction.•Benchmark dataset for evaluating the reconstruction of SfM as well as LOD3 models.•Deep learning models for facade and opening segmentation of free-standing masonry buildings. This paper describes a pipeline for automatically generating level of detail (LOD) models (digital twins), specifically LOD2 and LOD3, from free-standing buildings. Our approach combines structure from motion (SfM) with deep-learning-based segmentation techniques. Given multiple-view images of a building, we compute a three-dimensional (3D) planar abstraction (LOD2 model) of its point cloud using SfM techniques. To obtain LOD3 models, we use deep learning to perform semantic segmentation of the openings in the two-dimensional (2D) images. Unlike existing approaches, we do not rely on complex input, pre-defined 3D shapes or manual intervention. To demonstrate the robustness of our method, we show that it can generate 3D building shapes from a collection of building images with no further input. For evaluating reconstructions, we also propose two novel metrics. The first is a Euclidean–distance-based correlation of the 3D building model with the point cloud. The second involves re-projecting 3D model facades onto source photos to determine dice scores with respect to the ground-truth masks. Finally, we make the code, the image datasets, SfM outputs, and digital twins reported in this work publicly available in github.com/eesd-epfl/LOD3_buildings and doi.org/10.5281/zenodo.6651663. With this work we aim to contribute research in applications such as construction management, city planning, and mechanical analysis, among others.
ISSN:	0926-5805 1872-7891
DOI:	10.1016/j.autcon.2022.104430