AI Methods for Transforming a Text Prompt into an Immersive Volumetric Photo or Video
A text-to-image prompt is processed using a text-to-image machine learning model to obtain a non-immersive (e.g., rectilinear image). The non-immersive image may be enhanced by a superresolution machine learning model and processed with a monoscopic depth estimation model to obtain a depthmap. The n...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A text-to-image prompt is processed using a text-to-image machine learning model to obtain a non-immersive (e.g., rectilinear image). The non-immersive image may be enhanced by a superresolution machine learning model and processed with a monoscopic depth estimation model to obtain a depthmap. The non-immersive image and the depthmap may be converted to an immersive projection (e.g., F-theta) and corresponding depth map. The immersive projection may be out-painted. The immersive projection may be used to generate video with simulated camera movement, output on a VR headset, and/or processed to remove a background layer and displayed on an AR headset, or on an holographic glasses-free three-dimensional display. |
---|