Determining traversal space from single image

The model predicts the geometry of both visible and occluded transferable surfaces from the input image. The model may be trained according to a stereoscopic video sequence, using camera gestures, depth per frame, and semantic segmentation to form training data that is used to supervise an image-to-...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	FAIRMAN, MICHAEL, D, BROSTOW, GEORGE, J, WATSON JAMES, MONSPART, ANDREAS
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The model predicts the geometry of both visible and occluded transferable surfaces from the input image. The model may be trained according to a stereoscopic video sequence, using camera gestures, depth per frame, and semantic segmentation to form training data that is used to supervise an image-to-image network. In various embodiments, the model is applied to a single RGB image describing a scene to produce information describing a traversal space including an occluded traversal scene. The information describing the transferable space may include a partition mask of the transferable space (both visible and occluded) and a non-transferable space and a depth map indicating to an estimated depth corresponding to a transferable surface determined to correspond to each pixel of the transferable space. 模型从输入图像预测可见和被遮挡的可穿越表面两者的几何形状。该模型可以根据立体视频序列训练，使用相机姿势、每帧深度和语义分割来形成训练数据，该训练数据用于监督图像到图像网络。在各种实施例中，该模型被应用于描述场景的单个RGB图像以产生描述包括被遮挡的可穿越的场景的可穿越空间的信息。描述可穿越空间的信息可以包括可穿越空间(可见和被遮挡的两者)和不可穿越空间的分割掩码以及深度图，该深度图指示到与可穿越表面对应的估计深度，该可穿越表面