AI Methods for Transforming a Text Prompt into an Immersive Volumetric Photo or Video

A text-to-image prompt is processed using a text-to-image machine learning model to obtain a non-immersive (e.g., rectilinear image). The non-immersive image may be enhanced by a superresolution machine learning model and processed with a monoscopic depth estimation model to obtain a depthmap. The n...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Neal, Lawrence Wayne, Briggs, Forrest
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A text-to-image prompt is processed using a text-to-image machine learning model to obtain a non-immersive (e.g., rectilinear image). The non-immersive image may be enhanced by a superresolution machine learning model and processed with a monoscopic depth estimation model to obtain a depthmap. The non-immersive image and the depthmap may be converted to an immersive projection (e.g., F-theta) and corresponding depth map. The immersive projection may be out-painted. The immersive projection may be used to generate video with simulated camera movement, output on a VR headset, and/or processed to remove a background layer and displayed on an AR headset, or on an holographic glasses-free three-dimensional display.