Committee-Based Active Learning for Speech Recognition

We propose a committee-based method of active learning for large vocabulary continuous speech recognition. Multiple recognizers are trained in this approach, and the recognition results obtained from these are used for selecting utterances. Those utterances whose recognition results differ the most...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEICE Transactions on Information and Systems 2011/10/01, Vol.E94.D(10), pp.2015-2023
Hauptverfasser:	HAMANAKA, Yuzo, SHINODA, Koichi, TSUTAOKA, Takuya, FURUI, Sadaoki, EMORI, Tadashi, KOSHINAKA, Takafumi
Format:	Artikel
Sprache:	eng
Schlagworte:	active learning Applied sciences Artificial intelligence Computer science control theory systems Exact sciences and technology Information, signal and communications theory LVCSR progressive alignment query by committee Sampling, quantization Signal and communications theory Signal processing Speech and sound recognition and synthesis. Linguistics Speech processing Telecommunications and information theory
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We propose a committee-based method of active learning for large vocabulary continuous speech recognition. Multiple recognizers are trained in this approach, and the recognition results obtained from these are used for selecting utterances. Those utterances whose recognition results differ the most among recognizers are selected and transcribed. Progressive alignment and voting entropy are used to measure the degree of disagreement among recognizers on the recognition result. Our method was evaluated by using 191-hour speech data in the Corpus of Spontaneous Japanese. It proved to be significantly better than random selection. It only required 63h of data to achieve a word accuracy of 74%, while standard training (i.e., random selection) required 103h of data. It also proved to be significantly better than conventional uncertainty sampling using word posterior probabilities.
ISSN:	0916-8532 1745-1361
DOI:	10.1587/transinf.E94.D.2015