Video generation based on text

Techniques for generating a video sequence of a person based on a text sequence are disclosed herein. Based on the received text sequence, a processing device generates the video sequence of a personto simulate visual and audible emotional expressions of the person, including using an audio model of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ROUHI ALI, REZVANI BEHROOZ
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS CALCULATING COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY IMAGE DATA PROCESSING OR GENERATION, IN GENERAL MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION TELEPHONIC COMMUNICATION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Techniques for generating a video sequence of a person based on a text sequence are disclosed herein. Based on the received text sequence, a processing device generates the video sequence of a personto simulate visual and audible emotional expressions of the person, including using an audio model of the person's voice to generate an audio portion of the video sequence. The emotional expressions in the visual portion of the video sequence are simulated based a priori knowledge about the person. For instance, the a priori knowledge can include photos or videos of the person captured in real life. 本文公开了用于生成基于文本序列的人的视频序列的技术。基于接收到的文本序列，处理装置生成人的视频序列以模拟人的视觉和听觉情感表达，包括使用人的声音的音频模型生成视频序列的音频部分。基于对人的先验知识，处理装置可以模拟在视频序列的视觉部分中的情感表达。例如，先验知识可以包括在现实生活中的人的照片或视频。