Video generation based on text
Techniques for generating a video sequence of a person based on a text sequence are disclosed herein. Based on the received text sequence, a processing device generates the video sequence of a personto simulate visual and audible emotional expressions of the person, including using an audio model of...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Techniques for generating a video sequence of a person based on a text sequence are disclosed herein. Based on the received text sequence, a processing device generates the video sequence of a personto simulate visual and audible emotional expressions of the person, including using an audio model of the person's voice to generate an audio portion of the video sequence. The emotional expressions in the visual portion of the video sequence are simulated based a priori knowledge about the person. For instance, the a priori knowledge can include photos or videos of the person captured in real life.
本文公开了用于生成基于文本序列的人的视频序列的技术。基于接收到的文本序列,处理装置生成人的视频序列以模拟人的视觉和听觉情感表达,包括使用人的声音的音频模型生成视频序列的音频部分。基于对人的先验知识,处理装置可以模拟在视频序列的视觉部分中的情感表达。例如,先验知识可以包括在现实生活中的人的照片或视频。 |
---|