Depth map prediction from a single image with generative adversarial nets

A depth map is a fundamental component of 3D construction. Depth map prediction from a single image is a challenging task in computer vision. In this paper, we consider the depth prediction as an image-to-image task and propose an adversarial convolutional architecture called the Depth Generative Ad...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia tools and applications 2020-06, Vol.79 (21-22), p.14357-14374
Hauptverfasser:	Zhang, Shaoyong, Li, Na, Qiu, Chenchen, Yu, Zhibin, Zheng, Haiyong, Zheng, Bing
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Computer Communication Networks Computer Science Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods Computer vision Data Structures and Information Theory Datasets Engineering Engineering, Electrical & Electronic Image enhancement Methods Multimedia Multimedia Information Systems Neural networks Researchers Science & Technology Semantics Special Purpose and Application-Based Systems Technology
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A depth map is a fundamental component of 3D construction. Depth map prediction from a single image is a challenging task in computer vision. In this paper, we consider the depth prediction as an image-to-image task and propose an adversarial convolutional architecture called the Depth Generative Adversarial Network (DepthGAN) for depth prediction. To enhance the image translation ability, we take advantage of a Fully Convolutional Residual Network (FCRN) and combine it with a generative adversarial network, which has shown remarkable achievements in image-to-image tasks. We also present a new loss function including the scale-invariant (SI) error and the structural similarity (SSIM) loss function to improve our model and to output a high-quality depth map. Experiments show that the DepthGAN performs better in monocular depth prediction than the current best method on the NYU Depth v2 dataset.
ISSN:	1380-7501 1573-7721
DOI:	10.1007/s11042-018-6694-x