Data-Driven and Feedback Based Spectro-Temporal Features for Speech Recognition

This paper proposes novel data-driven and feedback based discriminative spectro-temporal filters for feature extraction in automatic speech recognition (ASR). Initially a first set of spectro-temporal filters are designed to separate each phoneme from the rest of the phonemes. A hybrid Hidden Markov...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE signal processing letters 2010-11, Vol.17 (11), p.957-960
Hauptverfasser:	Sivaram, G S V S, Nemala, Sridhar Krishna, Mesgarani, Nima, Hermansky, Hynek
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustics Confusion analysis Context discriminative filters Feature extraction Hidden Markov models spectro-temporal features Speech Speech processing Speech recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper proposes novel data-driven and feedback based discriminative spectro-temporal filters for feature extraction in automatic speech recognition (ASR). Initially a first set of spectro-temporal filters are designed to separate each phoneme from the rest of the phonemes. A hybrid Hidden Markov Model/Multilayer Perceptron (HMM/MLP) phoneme recognition system is trained on the features derived using these filters. As a feedback to the feature extraction stage, top confusions of this system are identified, and a second set of filters are designed specifically to address these confusions. Phoneme recognition experiments on TIMIT show that the features derived from the combined set of discriminative filters outperform conventional speech recognition features, and also contain significant complementary information.
ISSN:	1070-9908 1558-2361
DOI:	10.1109/LSP.2010.2079930