Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction
Indoor scenes exhibit rich hierarchical structure in 3D object layouts. Many tasks in 3D scene understanding can benefit from reasoning jointly about the hierarchical context of a scene, and the identities of objects. We present a variational denoising recursive autoencoder (VDRAE) that generates an...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Indoor scenes exhibit rich hierarchical structure in 3D object layouts. Many
tasks in 3D scene understanding can benefit from reasoning jointly about the
hierarchical context of a scene, and the identities of objects. We present a
variational denoising recursive autoencoder (VDRAE) that generates and
iteratively refines a hierarchical representation of 3D object layouts,
interleaving bottom-up encoding for context aggregation and top-down decoding
for propagation. We train our VDRAE on large-scale 3D scene datasets to predict
both instance-level segmentations and a 3D object detections from an
over-segmentation of an input point cloud. We show that our VDRAE improves
object detection performance on real-world 3D point cloud datasets compared to
baselines from prior work. |
---|---|
DOI: | 10.48550/arxiv.1903.03757 |