Strategies for distant speech recognitionin reverberant environments

Reverberation and noise are known to severely affect the automatic speech recognition (ASR) performance of speech recorded by distant microphones. Therefore, we must deal with reverberation if we are to realize high-performance hands-free speech recognition. In this paper, we review a recognition sy...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	EURASIP journal on advances in signal processing 2015-07, Vol.2015 (1), p.1-15, Article 60
Hauptverfasser:	Delcroix, Marc, Yoshioka, Takuya, Ogawa, Atsunori, Kubo, Yotaro, Fujimoto, Masakiyo, Ito, Nobutaka, Kinoshita, Keisuke, Espi, Miquel, Araki, Shoko, Hori, Takaaki, Nakatani, Tomohiro
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic noise Engineering Mathematical models Neural networks Quantum Information Technology Signal,Image and Speech Processing Speech Speech processing Speech recognition Spintronics Strategy Tasks ‘Silencing the Echoes’ – Processing of Reverberant Speech
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Reverberation and noise are known to severely affect the automatic speech recognition (ASR) performance of speech recorded by distant microphones. Therefore, we must deal with reverberation if we are to realize high-performance hands-free speech recognition. In this paper, we review a recognition system that we developed at our laboratory to deal with reverberant speech. The system consists of a speech enhancement (SE) front-end that employs long-term linear prediction-based dereverberation followed by noise reduction. We combine our SE front-end with an ASR back-end that uses neural networks for acoustic and language modeling. The proposed system achieved top scores on the ASR task of the REVERB challenge. This paper describes the different technologies used in our system and presents detailed experimental results that justify our implementation choices and may provide hints for designing distant ASR systems.
ISSN:	1687-6180 1687-6172 1687-6180
DOI:	10.1186/s13634-015-0245-7