Robust Speech Recognition Using Teacher-Student Learning Domain Adaptation

Recently, robust speech recognition for real-world applications has attracted much attention. This paper proposes a robust speech recognition method based on the teacher-student learning framework for domain adaptation. In particular, the student network will be trained based on a novel optimization...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEICE Transactions on Information and Systems 2022/12/01, Vol.E105.D(12), pp.2112-2118
Hauptverfasser:	MA, Han, ZHANG, Qiaoling, TANG, Roubing, ZHANG, Lu, JIA, Yubo
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptation automatic speech recognition Coders domain adaptation Domains Learning noise robustness Optimization Robustness Speech recognition teacher-student learning Teachers
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Recently, robust speech recognition for real-world applications has attracted much attention. This paper proposes a robust speech recognition method based on the teacher-student learning framework for domain adaptation. In particular, the student network will be trained based on a novel optimization criterion defined by the encoder outputs of both teacher and student networks rather than the final output posterior probabilities, which aims to make the noisy audio map to the same embedding space as clean audio, so that the student network is adaptive in the noise domain. Comparative experiments demonstrate that the proposed method obtained good robustness against noise.
ISSN:	0916-8532 1745-1361
DOI:	10.1587/transinf.2022EDP7043