Perceptual Properties of Current Speech Recognition Technology

In recent years, a number of feature extraction procedures for automatic speech recognition (ASR) systems have been based on models of human auditory processing, and one often hears arguments in favor of implementing knowledge of human auditory perception and cognition into machines for ASR. This pa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Proceedings of the IEEE 2013-09, Vol.101 (9), p.1968-1985
Hauptverfasser:	Hermansky, Hynek, Cohen, Jordan R., Stern, Richard M.
Format:	Artikel
Sprache:	eng
Schlagworte:	Auditory perception Auditory system Cavity resonators Educational institutions Feature extraction Resonant frequency Speech recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In recent years, a number of feature extraction procedures for automatic speech recognition (ASR) systems have been based on models of human auditory processing, and one often hears arguments in favor of implementing knowledge of human auditory perception and cognition into machines for ASR. This paper takes a reverse route, and argues that the engineering techniques for automatic recognition of speech that are already in widespread use are often consistent with some well-known properties of the human auditory system.
ISSN:	0018-9219 1558-2256
DOI:	10.1109/JPROC.2013.2252316