Speaker recognition in multimedia system

A method for identifying a user among a plurality of users of a multimedia system comprising extracting an i-vector for the speech utterance using total variability modeling,comparing the extracted i-vector with a collection of i-vector sets in order to identify a target set most similar to the extr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	A LAKSHMI LATTAN, D M CHICKLIN, J D WILLIAMS, LAU P DASGI, A KOLOBOVI, N B NILE, C GARCIA JURA DUO SAREZ, G G ZWICK
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A method for identifying a user among a plurality of users of a multimedia system comprising extracting an i-vector for the speech utterance using total variability modeling,comparing the extracted i-vector with a collection of i-vector sets in order to identify a target set most similar to the extracted i-vector, and granting access to the multimedia system in accordance with an access profile associated with the identified target set. Further, source variation is minimized by, for each speech utterance acquired using a specific data source, re-centering first-order statistics of the speech utterance around the mean of an informative prior associated with the source, and using the co-variance of the informative prior associated with the source when extracting the i-vector for the speech utterance. 提供了可以被快速且有效地构建、测试并改进的特定于域的语言理解模型。提供了使得开发方能够快速且在无需专门的机器学习知识的情况下构建用户意图检测模型、语言实体提取模型和语言实体解析模型的方法、系统和设备。这些模型可以经由单模型系统所构建并实施，该单模型系统使得模型能够被隔离地或者在端到端流水线系统中被构建，该流水线系统使得模型能够以同时的方式被构建和改进。