Video recording processing

The present disclosure provides a method, an apparatus, a computer program product, and a non-transitory computer readable medium for processing a video recording of a target application. A video record of the target application may be obtained. Multi-modal data for the video recording may be obtain...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	YING QIANLAN, CHEN CHUANSHI, GUO JINGRU, CHEN GAOJUN, XIA XIAOBO, ZHOU ZHANGYAN, CAO WENWEN, WANG RONGZHAO
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS CALCULATING COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRIC DIGITAL DATA PROCESSING ELECTRICITY MUSICAL INSTRUMENTS PHYSICS PICTORIAL COMMUNICATION, e.g. TELEVISION SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The present disclosure provides a method, an apparatus, a computer program product, and a non-transitory computer readable medium for processing a video recording of a target application. A video record of the target application may be obtained. Multi-modal data for the video recording may be obtained, the multi-modal data including at least one of speech transcription, video, image, text, and event information. Multi-modal features of the video recording may be generated based on the multi-modal data, the multi-modal features including at least one of a speech transcription feature, a video feature, an image feature, a textual feature, and an event feature. Target content associated with the video recording may be determined based at least on the multi-modal feature. 本公开提供了用于处理目标应用的视频录像的方法、装置、计算机程序产品和非暂时性计算机可读介质。可以获得所述目标应用的视频录像。可以获得所述视频录像的多模态数据，所述多模态数据包括语音转录、视频、图像、文本和事件信息中至少之一。可以基于所述多模态数据生成所述视频录像的多模态特征，所述多模态特征包括语音转录特征、视频特征、图像特征、文本特征和事件特征中至少之一。可以至少基于所述多模态特征来确定与所述视频录像相关联的目标内容。