Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model

A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expec...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEICE Transactions on Information and Systems 2012/10/01, Vol.E95.D(10), pp.2469-2478
Hauptverfasser:	KOSHINAKA, Takafumi, NAGATOMO, Kentaro, SHINODA, Koichi
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Classification Clustering Clusters Computer science control theory systems Errors Exact sciences and technology HMM Information, signal and communications theory Learning meeting recognition model selection On-line systems Online Real time Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Speech and sound recognition and synthesis. Linguistics Speech processing Telecommunications and information theory variational Bayesian learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation.
ISSN:	0916-8532 1745-1361
DOI:	10.1587/transinf.E95.D.2469