MannequinChallenge: Learning the Depths of Moving People by Watching Frozen People

We present a method for predicting dense depth in scenarios where both a monocular camera and people in the scene are freely moving (right). Existing methods for recovering depth for dynamic, non-rigid objects from monocular video impose strong assumptions on the objects' motion and may only re...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence 2021-12, Vol.43 (12), p.4229-4241
Hauptverfasser:	Li, Zhengqi, Dekel, Tali, Cole, Forrester, Tucker, Richard, Snavely, Noah, Liu, Ce, Freeman, William T.
Format:	Artikel
Sprache:	eng
Schlagworte:	Cameras Cleaning Depth prediction dynamic scene reconstruction Freezing Geometric constraints Geometry Image reconstruction Internet mannequin challenge Mannequins Parallax Predictions Three-dimensional displays Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We present a method for predicting dense depth in scenarios where both a monocular camera and people in the scene are freely moving (right). Existing methods for recovering depth for dynamic, non-rigid objects from monocular video impose strong assumptions on the objects' motion and may only recover sparse depth. In this paper, we take a data-driven approach and learn human depth priors from a new source of data: thousands of Internet videos of people imitating mannequins, i.e., freezing in diverse, natural poses, while a hand-held camera tours the scene (left). Because people are stationary, geometric constraints hold, thus training data can be generated using multi-view stereo reconstruction. At inference time, our method uses motion parallax cues from the static areas of the scenes to guide the depth prediction. We evaluate our method on real-world sequences of complex human actions captured by a moving hand-held camera, show improvement over state-of-the-art monocular depth prediction methods, and demonstrate various 3D effects produced using our predicted depth.
ISSN:	0162-8828 1939-3539 2160-9292
DOI:	10.1109/TPAMI.2020.2974454