A Tutorial on Pronunciation Modeling for Large Vocabulary Speech Recognition
Automatic speech recognition (ASR) research has progressed from the recognition of read speech to the recognition of spontaneous conversational speech in the past decade, prompting some in the field to re-evaluate ASR pronunciation models and their role of capturing the increased phonetic variabilit...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Automatic speech recognition (ASR) research has progressed from the recognition of read speech to the recognition of spontaneous conversational speech in the past decade, prompting some in the field to re-evaluate ASR pronunciation models and their role of capturing the increased phonetic variability within unscripted speech. Two basic approaches for modeling pronunciation variation have emerged: encoding linguistic knowledge to pre-specify possible alternative pronunciations of words and deriving alternatives directly from a pronunciation corpus. This tutorial is intended to ground the reader in the basic linguistic concepts in phonetics and phonology that guide both of these techniques and to outline several pronunciation modeling strategies that have been employed through the years. The chapter will conclude with a summary of some promising recent research directions. |
---|---|
ISSN: | 0302-9743 1611-3349 |
DOI: | 10.1007/978-3-540-45115-0_3 |